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SIMIAN ADENOVIRUS NUCLEIC ACID AND AMINO ACID SEQUENCES, 
VECTORS CONTAINING SAME, AND METHODS OF USE 

5 BACKGROUND OF THE INVENTION 

Adenovirus is a double-stranded DNA virus with a genome size of about 36 
kilobases (kb), which has been widely used for gene transfer applications due to its ability 
to achieve highly efficient gene transfer in a variety of target tissues and large transgene 
capacity. Conventionally, El genes of adenovirus are deleted and replaced with a 

10 transgene cassette consisting of the promoter of choice, cDNA sequence of the gene of 
interest and a poly A signal, resulting in a replication defective recombinant virus. 

Adenoviruses have a characteristic morphology with an icosahedral capsid 
consisting of three major proteins, hexon (II), penton base (III) and a knobbed fibre (IV), 
along with a number of other minor proteins, VI, VIII, IX, Ilia and IVa2 [W.C. Russell, J. 

15 Gen Virol. 9 81:2573-2604 (Nov 2000)]. The virus genome is a linear, double-stranded 
DNA with a terminal protein attached covalently to the 5' termini, which have inverted 
terminal repeats (ITRs). The virus DNA is intimately associated with the highly basic 
protein VII and a small peptide termed mu. Another protein, V, is packaged with this 
DNA-protein complex and provides a structural link to the capsid via protein VI. The 

2 0 virus also contains a virus-encoded protease, which is necessary for processing of some of 
the structural proteins to produce mature infectious virus. 

Recombinant adenoviruses have been described for delivery of molecules to host 
cells. See, US Patent 6,083,716, which describes the genome of two chimpanzee 
adenoviruses. 

25 What is needed in the art are more effective vectors which avoid the effect of pre- 

existing immunity to selected adenovirus serotypes in the population and/or which are 
useful for repeat administration and for titer boosting by second vaccination, if required. 

Summary of the Invention 
30 The present invention provides the isolated nucleic acid sequences and amino acid 

sequences of six simian adenoviruses, vectors containing these sequences, and cell lines 
expressing simian adenovirus genes. Also provided are a number of methods for using 
the vectors and cells of the invention. 
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The methods of the invention involve delivering one or more selected 
heterologous gene(s) to a mammalian patient by administering a vector of the invention. 
Because the various vector constructs are derived from simian rather than from human 
adenoviruses, the immune system of the non-simian human or veterinary patient is not 
5 primed to respond immediately to the vector as a foreign antigen. Use of the 

compositions of this invention thus permits a more stable expression of the selected 
transgene when administered to a non-simian patient. Use of the compositions of this 
invention for vaccination permits presentation of a selected antigen for the elicitation of 
protective immune responses. Without wishing to be bound by theory, the ability of the 

10 adenoviruses of the invention to transduce human dendritic cells is at least partially 
responsible for the ability of the recombinant constructs of the invention to induce an 
immune response. The recombinant simian adenoviruses of this invention may also be 
used for producing heterologous gene products in vitro. Such gene products are 
themselves useful in a variety for a variety of purposes such as are described herein. 

15 These and other embodiments and advantages of the invention are described in 

more detail below. 



Brief Description of the Drawings 

Fig. 1 provides an alignment of the amino acid sequences of the LI and a portion 

2 0 of the L2 loops of the capsid protein hexon of the chimpanzee adenovirus CI [SEQ ID 

NO:13], chimpanzee adenovirus C68 (Pan-9) [SEQ ID NO:14], and the novel Pan5 [SEQ 
ID NO: 15], Pan6 [SEQ ID NO: 16] and Pan7 [SEQ ID NO: 17] chimpanzee adenovirus 
sequences of the invention. The intervening conserved region is part of the pedestal 
domain conserved between adenovirus serotypes. 
25 Fig. 2 provides an alignment of the amino acid sequences of the fiber knob 

domains of chimpanzee C68 (Pan-9) [SEQ ID NO: 18], Pan-6 [SEQ ID NO: 19], Pan-7 
[SEQ ID NO:20], and Pan-5 [SEQ ID NO:21] and the human adenoviruses serotypes 2 
[SEQ ID NO:22] and 5 [SEQ ID NO:23]. 

3 o DETAILED DESCRIPTION OF THE INVENTION 

The invention provides novel nucleic acid and amino acid sequences from Ad 
Pan5 [SEQ ID NO:l-4, 15 and 21], Ad Pan6 [SEQ ID NO: 5-8, 16, 19], and Ad serotype 

2 
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Pan7 [SEQ ID NO: 9-12, 17, 20], which were originally isolated from chimpanzee lymph 
nodes. In several instances throughout the specification, these adenoviruses are 
alternatively termed herein C5, C6 and C7, respectively. Also provided are sequences 
from adenovirus SV1 [SEQ ID NO: 24-28], which was originally isolated from the 
5 kidney cells of cynomolgus monkey. The invention also provides sequences of 

adenoviruses SV-25 [SEQ IDNO:29-33] and SV-39 [SEQ ID NO: 34-37], which were 
originally isolated from rhesus monkey kidney cells. 

The present invention provides novel adenovirus vectors and packaging cell lines 
to produce those vectors for use in the in vitro production of recombinant proteins or 

10 fragments or other reagents. The invention further provides compositions for use in 

delivering a heterologous molecule for therapeutic or vaccine purposes. Such therapeutic 
or vaccine compositions contain the adenoviral vectors carrying an inserted heterologous 
molecule. In addition, novel sequences of the invention are useful in providing the 
essential helper functions required for production of recombinant adeno-associated viral 

15 (AAV) vectors. Thus, the invention provides helper constructs, methods and cell lines 
which use these sequences in such production methods. 

The term "substantial homology" or "substantial similarity," when referring to a 
nucleic acid or fragment thereof, indicates that, when optimally aligned with appropriate 
nucleotide insertions or deletions with another nucleic acid (or its complementary strand), 

2 0 there is nucleotide sequence identity in at least about 95 to 99% of the aligned sequences. 

The term "substantial homology" or "substantial similarity," when referring to 
amino acids or fragments thereof, indicates that, when optimally aligned with appropriate 
amino acid insertions or deletions with another amino acid (or its complementary strand), 
there is amino acid sequence identity in at least about 95 to 99% of the aligned sequences. 

25 Preferably, the homology is over full-length sequence, or a protein thereof, or a fragment 
thereof which is at least 8 amino acids, or more desirably, at least 15 amino acids in 
length. Examples of suitable fragments are described herein. 

The term "percent sequence identity" or "identical" in the context of nucleic acid 
sequences refers to the residues in the two sequences that are the same when aligned for 

30 maximum correspondence. The length of sequence identity comparison may be over the 
full-length of the genome (e.g., about 36 kbp), the full-length of an open reading frame of 
a gene, protein, subunit, or enzyme [see, e.g., the tables providing the adenoviral coding 
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regions], or a fragment of at least about 500 to 5000 nucleotides, is desired. However, 
identity among smaller fragments, e.g. of at least about nine nucleotides, usually at least 
about 20 to 24 nucleotides, at least about 28 to 32 nucleotides, at least about 36 or more 
nucleotides, may also be desired. Similarly, "percent sequence identity" may be readily 
5 determined for amino acid sequences, over the full-length of a protein, or a fragment 
thereof. Suitably, a fragment is at least about 8 amino acids in length, and may be up to 
about 700 amino acids. Examples of suitable fragments are described herein. 

Identity is readily determined using such algorithms and computer programs as 
are defined herein at default settings. Preferably, such identity is over the full length of 

10 the protein, enzyme, subunit, or over a fragment of at least about 8 amino acids in length. 
However, identity may be based upon shorter regions, where suited to the use to which 
the identical gene product is being put. 

As described herein, alignments are performed using any of a variety of publicly 
or commercially available Multiple Sequence Alignment Programs, such as "Clustal W", 

15 accessible through Web Servers on the internet. Alternatively, Vector NTI utilities are 
also used. There are also a number of algorithms known in the art that can be used to 
measure nucleotide sequence identity, including those contained in the programs 
described above. As another example, polynucleotide sequences can be compared using 
Fasta, a program in GCG Version 6. 1 . Fasta provides alignments and percent sequence 

20 identity of the regions of the best overlap between the query and search sequences. For 
instance, percent sequence identity between nucleic acid sequences can be determined 
using Fasta with its default parameters (a word size of 6 and the NOP AM factor for the 
scoring matrix) as provided in GCG Version 6.1, herein incorporated by reference. 
Similarly programs are available for performing amino acid alignments. Generally, these 

25 programs are used at default settings, although one of skill in the art can alter these 

settings as needed. Alternatively, one of skill in the art can utilize another algorithm or 
computer program that provides at least the level of identity or alignment as that provided 
by the referenced algorithms and programs. 

As used throughout this specification and the claims, the term "comprise" and its 

30 variants including, "comprises 55 , "comprising 55 , among other variants, is inclusive of other 
components, elements, integers, steps and the like. The term "consists of 5 or "consisting 
of 5 are exclusive of other components, elements, integers, steps and the like. 
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I. The Simian Adenovirus Sequences 

The invention provides nucleic acid sequences and amino acid sequences of 
Pan5, Pan6 5 Pan7, SV1, SV25 and SV39, which are isolated from the other viral material 
5 with which they are associated in nature. 

A. Nucleic Acid Sequences 

The Pan5 nucleic acid sequences of the invention include nucleotides 1 to 
36462 of SEQ ID NO:l, The Pan6 nucleic acid sequences of the invention include 
nucleotides 1 to 36604 of SEQ ID NO: 5. The Pan7 nucleic acid sequences of the 
invention include nucleotides 1 to 36535 of SEQ ID NO: 9. The SV1 nucleic acid 
sequences of the invention include nucleotides 1 to 34264 of SEQ ID NO: 24. The SV25 
nucleic acid sequences of the invention include nucleotides 1 to 31044 of SEQ ID NO: 
29. The SV39 nucleic acid sequences of the invention include nucleotides 1 to 341 15 of 
SEQ ID NO: 34. See, Sequence Listing, which is incorporated by reference herein. 

The nucleic acid sequences of the invention further encompass the 
strand which is complementary to the sequences of SEQ ID NO: 5, 9 5 24 5 29 and 34, 
as well as the RNA and cDNA sequences corresponding to the sequences of these 
sequences figures and their complementary strands. Further included in this invention 
are nucleic acid sequences which are greater than 95 to 98%, and more preferably 
about 99 to 99.9% homologous or identical to the Sequence Listing. Also included in 
the nucleic acid sequences of the invention are natural variants and engineered 
modifications of the sequences provided in SEQ ID NO: 5, 9, 24, 29 and 34 and their 
complementary strands. Such modifications include, for example, labels that are 
known in the art, methylation, and substitution of one or more of the naturally 
occurring nucleotides with a degenerate nucleotide. 

The invention further encompasses fragments of the sequences of 
Pan5, Pan6, Pan7, SV1, SV25 and SV39, their complementary strand, cDNA and 
RNA complementary thereto. Suitable fragments are at least 15 nucleotides in length, 
and encompass functional fragments, i.e., fragments which are of biological interest. 
For example, a functional fragment can express a desired adenoviral product or may 
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be useful in production of recombinant viral vectors. Such fragments include the gene 
sequences and fragments listed in the tables below. 

The following tables provide the transcript regions and open reading frames 
in the simian adenovirus sequences of the invention. For certain genes, the transcripts 
5 and open reading frames (ORFs) are located on the strand complementary to that 
presented in SEQ ID NO: 5, 9, 24, 29 and 34. See, e.g., E2b, E4 and E2a. The 
calculated molecular weights of the encoded proteins are also shown. Note that the 
Ela open reading frame Pan5 [nt 576-1436 of SEQ ID NO:l],Pan6 [nt 576 to 1437 of 
SEQ ID NO: 5] and Pan7 [nt 576 to 1437 of SEQ ID NO: 9] contain internal splice 
10 sites. These splice sites are noted in the following tables. 
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Ad Pan- 5 [SEQ ID NO: I] 


Regions 


Start 
(nt) 


End 
(nt) 


a T TXT 

M.W. 
(Daltons) 


ITR 


l 


120 




bla 


• j 

Iranscnpt 


478 




_ 

- 




13S 


576-664, 


1233- 
1436 


28120 




12S 


576-1046, 1233- 
1436 


24389 




9S 


576-644,1233- 

1436 


9962 




Transcript 




1516 




Elb 


Transcript 


1552 








Small T 


1599 


2171 


773 7 7 




Large T 


1904 


3412 






IX 


3492 


3920 






Transcript 




3959 




E2b 


Transcript 


10349 








PTP 


10349 


8451 


77Qin 




Polymerase 


8448 


5083 


197237 




IVa2 


5604 


3980 






Transcript 




3960 




28.1 kD 


5155 


5979 




Agnoprotein 


7864 


8580 


ZD / J J 


LI 


Transcript 


10849 








52/5 5D 


10851 


12025 






Ilia 


12050 


13819 


65669 




Transcript 




13832 






Transcript 


13894 






L2 


Penton 


13898 


15490 


59292 




VII 


15494 


16078 


21478 




V 


16123 


17166 


39568 




Mu 


17189 


17422 


8524 




transcript 




17442 






Transcript 


17488 






L3 


VI 


17491 


18222 


26192 




Hexon 


18315 


21116 


104874 




Endoprotease 


20989 


21783 


28304 




transcript 




21811 





1 
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Ad Pan- 5 (cont'd) [SEQ ID NO:l] 


Regions 


Start 


End 


M.W. 






(nt) 


(nt) 


(Daltons) 


E2a 


Transcript 


26782 








DBP 


23386 


21845 


57358 




transcript 




21788 




T 4. 


i rdnsci ipt 










1 OOkD 


9^41 9 
Z 1Z 


ZjoUj 


ooJJj 




jj ivJU' noiTioiog 




ZOjjO 


Tic DO 

24jdg 




VTTf 

V 111 


O A/19 £ 


971 1 1 
Z / 1 1 1 


1 /f 7/TO 

24/0(5 




Li dnbcripx 




99/1 9 1 
Z /4Z 1 






1 1 ansci lpx 


zo/oo 








Wl 1 7? 1 


971 1 9 
Z / 1 1Z 


11 A LI 
Z /4JZ 


7 inn o 
1209(5 




Orf #2 


z /ooo 


O QA1 1 


23040 




Orf#3 


27994 


28527 






Orf #4 


28557 


29156 


& *j \j / 




Orf#5 


29169 


29783 


22267 




Orf #6 


29798 


30673 


31458 




Orf #7 


30681 


30956 


10477 




Orf #8 


30962 


31396 


16523 




Orf #9 


31389 


31796 






transcript 




31837 




L5 


Transcript 


32032 








Fiber 


32035 


33372 


47670 




transcript 




33443 




E4 


Transcript 


36135 








Orf 7 


33710 


33462 


9191 




Orf 6 


34615 


33710 


35005 




Orf 4 


34886 


34521 


13878 




Orf 3 


35249 


34896 


13641 




Orf 2 


35635 


35246 


14584 




Orf 1 


36050 


35676 


13772 




Transcript 




33437 




ITR 




36343 


36462 
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AdPan-6 [SEQIDNO:5] 


Regions 


Start (nt) 


End 
(nt) 


M.W. 
(Daltons) 


ITR 


1 


123 


- 


h,la 


transcript 


478 




- 




loo 


576-1143, 1229- 
1437 


28291 




HQ 


576-1050, 1229- 
1437 


24634 




y o 


576-645, 1229- 
1437 


10102 




transcript 




1516 


- 


Elb 


frail ^priirf 


1553 




- 




Small T 


1600 


2172 


22315 




LargeT 


1905 


3413 


55594 




IX 


3498 


3926 


14427 




transcript 




3965 


- 


E2b 


transcript 


10341 




** 




PTP 


10340 


8451 


72570 




Polymerase 


8445 


5089 


126907 




IVa2 


5610 


3986 


50452 




transcript 




3966 


- 


LI 


transcript 


10838 




- 




52/55 kD 


10840 


12012 


44205 




Ilia 


12036 


13799 


65460 




Transcript 




13812 


- 


28.1 kd 


5161 


5985 


28012 


Agnoprotein 


7870 


8580 


25382 


L2 


transcript 


13874 




- 




Penton 


13878 


15467 


59314 




VII 


15471 


16055 


21508 




V 


16100 


17137 


39388 




Mu 


17160 


17393 


OJUO 




transcript 




17415 




L3 


transcript 


17466 








VI 


17469 


18188 


25860 




Hexon 


18284 


21112 


106132 




Endoprotease 


21134 


21754 


23445 




transcript 




21803 




E2a 


transcript 


26780 








DBP 


23375 


21837 


57299 




transcript 




21780 
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Ad Pan-6 (cont'd) [SEQ ID NO:5] 



Regions 


Start (nty 


End 


M.W. 
(Ualtons) 


L4 


TV an or* 1*1 t^f" 


o o o r\ o 

23398 








z34U4 


25806 


O O ^77 


33 lcD homolog 


25523 


26357 


24609 


VIII 


26426 


27109 


24749 


transcript 




27419 


- 


E3 


transcript 


26786 




- 


Orf #1 


27110 


27430 


12098 


Ort #2 


27384 


28007 


22880 


Ort #3 


27989 


28519 


L 19460 


Ort #4 


28553 


29236 


25403 


Ori #5 


29249 


29860 


22350 


Orf #6 


29875 


30741 


31028 


Orf #7 


30749 


31024 


10469 


Orf #8 


31030 


31464 


16540 


Ort #9 


31457 


31864 


15264 


transcript 




31907 


- 


T C 

L5 


transcript 


32159 






Fiber 


32162 


33493 


47364 


transcript 




33574 




E4 


transcript 


36276 






Orf 7 


33841 


33593 


9177 


Orf 6 


34746 


33841 


35094 


Orf 4 


35017 


34652 


13937 


Orf 3 


35380 


35027 


13627 


Orf 2 


35766 


35377 


14727 


Orf 1 


36181 


35807 


13739 


transcript 




33558 




ITR 




36482 


36604 





5 
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AdPan-7 [SEQIDNO:9] 



Regions 


Start (nt) 


End (nt) 


M.W. (Daltons) 


ITR 




1 


132 


_ 


Dla 


Ti an script 


478 








576-1143, 1229-1437 


28218 


12S 


576- 1050, 1229- 1437 


OA SKI 


9S 


576-645, 
1229-1437 


in i no 

1 Ul l/Z 


transcript 




1516 


_ 


Elb 


transcript 








Small T 


i Ann 


Zl /o 


22559 


LargeT 


L y\)j 


jh 1 y 


55698 


IVa2 


jyyl 


^ A1 A 


50210 


transcript 




0Q7 1 




E2b 


transcript 


IU341 




_ 


PTP 




OHO / 


72297 


Polymerase 


QA < 1 


^no^\ 


126994 


IX 




jyjZ 


14441 


transcript 




3972 




28.1kD 


5167 


5991 


ZoUZo 


Agnoprotein 


7876 


8586 


O G A 1 yf 

2 J 424 


LI 


transcript 


10834 






52/55 kD 


10836 


12011 


44302 


Ilia 


12035 


13795 


65339 


transcript 




13808 




L2 


transcript 


13870 




- 


Penton 


13874 


15469 


59494 


VII 


15473 


16057 


21339 


V 


16102 


17139 


39414 


Mu 


17167 


17400 


85U6 


transcript 




17420 




L3 


transcript 


17467 






VI 


17470 


18198 


26105 


Hexon 


18288 


21086 


104763 


Endoprotease 


21106 


21732 


23620 


transcript 




21781 




E2a 


transcript 


26764 






DBP 


23353 


21815 


57199 


transcript 




21755 
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Ad Pan-7 (cont'd) [SEQ ID NO: 9] 


Regions 


Mart (nt) 


jc/na (nt) 


M. W. (Daltons) 


L4 


transcript 


23310 








lOOkD 


23376 


25781 


88520 




33 kD 
homolog 


25489 


26338 


25155 




VIII 


26410 


27093 


24749 




transcript 




27403 


- 


E3 


transcript 


26770 




- 




Orf#l 


27094 


27414 


12056 




Orf#2 


27368 


27988 


22667 




Orf#3 


27970 


28500 


19462 




Of #4 


28530 


29150 


22999 




Of #5 


29163 


29777 


22224 




Orf #6 


29792 


30679 


32153 




Orf #7 


30687 


30962 


10511 




Orf #8 


30968 


31399 


16388 




Orf #9 


31392 


31799 


15205 




transcript 




31842 




L5 


transcript 


32091 








Fiber 


O O A C\ A 

j2094 




4/344 




transcript 








E4 


transcript 


36208 








Orf 7 


33784 


33536 


9191 




Orf 6 


34689 


33784 


35063 




Orf 4 


34960 


34595 


13879 




Orf 3 


35323 


34970 


13641 




Orf 2 


35709 


35320 


14644 




Orf 1 


36123 


35749 


13746 




transcript 




33501 




ITR 




36404 


36535 
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Ad SV-1 
[SEQ ID NO:24] 


Ad SV-25 
[SEQ ID NO:29] 


Ad SV-39 
[SEQ ID NO: 34] 


Region 


Start 


End 


Start 


End 


Start 


End 


ITR 


1 


106 


1 


133 


1 


150 


Ela 


352 


1120 






404 


1409 


Elb 


1301 


2891 


359 


2273 


1518 


3877 


E2b 


9257 


2882 


9087 


2754 


10143 


3868 


E2a 


24415 


20281 


24034 


20086 


25381 


21228 


E3 


24974 


27886 


24791 


25792 


25790 


29335 


E4 


33498 


30881 


30696 


28163 


33896 


31157 


ITR 


34145 


34264 


30912 


31044 


33966 


34115 






Ad SV-1 
[SEQ ID NO:24] 


Ad SV-25 
[SEQ ID NO:29] 


Ad SV-39 
[SEQ ID NO: 34] 


Region 


Start 


End 


Start 


End 


Start 


End 


ITR 


1 


106 


1 


133 


1 


150 


LI 


9513 


12376 


9343 


12206 


10416 


13383 


L2 


12453 


15858 


12283 


15696 


13444 


16877 


L3 


15910 


20270 


15748 


20080 


17783 


21192 


L4 


21715 


25603 


21526 


25420 


22659 


26427 


L5 


28059 


30899 


25320 


28172 


29513 


31170 


ITR 


34145 


34264 


30912 


31044 


33966 


34115 
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protein 


Ad SV-1, SEQIDNO: 24 






Start 


End 


M. W. 


ITR 




1 


106 




Ela 


13S 


459 


953 

y ~J +J 


18039 

i (J w -J y 


12S 








Elb 


Small T 








LargeT 


1301 

1 Jul 


241 3 


47791 


IX 


2391 


2885 


16RR7 


E2b 


IVa2 




2994 

Zuy Jii'-x 


S40R7 


Polymerase 


6750 


4-077 


J07RR1 


PTP 


9957 


7371 


7741 1 




Agno-protein 


6850 


7455 


70QR4 


LI 


52/55 kD 


95 1 5 

/j i j 


1 0647 


4767 s ) 

-rz, U / J 


Ilia 


1 0663 


12377 


61656R 

l/Ji/J uo 


L2 


Penton 


19454 


1 3965 


J U / £ J 


VII 


1 3968 

1 J7UO 


1 4531 


70107 


V 


14588 


1 5695 




Mu 


15645 


15857 


7568 


L3 


VI 


15911 


16753 


30418 


Hexon 


16841 


19636 


104494 


Endoprotease 


19645 


20262 


23407 


2a 


DBP 


21700 


20312 


52107 


L4 


lOOkD 


21721 


24009 


85508 


VIII 


24591 


25292 


25390 
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protein 


Ad SV-1 (cont'd) 
SEQ ID NO: 24 






Start 


End 


M.W. 


E3 


Orf#l 


25292 


25609 


11950 


Orf #2 


25563 


26081 


18940 


Orf#3 


26084 


26893 


30452 


Orf #4 


26908 


27180 


10232 


Orf #5 


27177 


17512 


12640 


Orf #6 


27505 


27873 


13639 


L5 


Fiber #2 


28059 


29150 


39472 




Fiber #1 


29183 


30867 


61128 


E4 


Orf7 


31098 


30892 


7837 


Orf 6 


31982 


31122 


33921 


Orf4 


32277 


31915 


14338 


Orf 3 


32629 


32279 


13386 


Orf 2 


33018 


32626 


14753 


Orf 1 


33423 


33043 


14301 


ITR 




34145 


34264 
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protein 


Ad SV-25, SEQ ID NO:29 


Ad SV-39, SEQ IDNO:34 






Start 


End 


M.W. 


Start 


End 


M.W. 


ITR 




1 


133 


- 


1 


150 




Ela 


13S 








492 


1355 


28585 


12S 








492 


1355 


25003 


Elb 


Small T 


478 


1030 


20274 


1518 


2075 


21652 


Large T 


829 


2244 


52310 


1823 


3349 


55534 


IX 


2306 


2716 


13854 


3434 


3844 


14075 


E2b 


IVa2 


4208 


2755 


54675 


3912 


5141 


46164 


Poly- 
merase 


6581 


3858 


102839 


7753 


5033 


103988 


PTP 


9087 


7207 


71326 


10143 


8335 


69274 




Agno- 
protein 


6681 


7139 


16025 








LI 


52/55 IcD 


9345 


10472 


42703 


10418 


11608 


44232 


Ilia 


10493 


12202 


63598 


11574 


13364 


66078 


L2 


Penton 


12284 


13801 


56949 


13448 


14959 


56292 


VII 


13806 


14369 


20369 


14960 


15517 


20374 


V 


14426 


15463 


39289 


15567 


16628 


39676 


Mu 


15483 


15695 


7598 


16650 


16871 


7497 


L3 


VI 


15749 


16591 


30347 


16925 


17695 


28043 


Hexon 


16681 


19446 


104035 


17785 


20538 \ 


102579 


Endo- 
protease 


19455 


20072 


23338 


20573 


21181 


22716 


2a 


DBP 


21511 


20123 


52189 


22631 


21231 


53160 


L4 


lOOkD 


21532 


23829 


85970 


22659 


25355 


100362 


VIII 


24408 


25109 


25347 


25410 


26108 


25229 
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protein 


Ad SV-25, SEQ ID NO:29 
(cont'd) 


Ad SV-39, SEQ ID NO:34, 
(cont'd) 






O Ldl L 




1VX. YY . 


Start 


Fnrl 

JL'IJLU. 


M W 

IVJ.. YY . 


E3 


Orf#l 


QC 1 no 
ZD I \Jy 


9^A9£ 


/ 1 RQf) 


ZDj / D 


01 ASIA 


¥ZZJ / 


Orf#2 








Z / JOU 


98^7 


zy/oj 


Orf#3 








98^70 
ZOj / U 


ZOO'tJ 


10^14 


Orf#4 








9 

ZooOj 


Z7JJJ 


1 OODJ 


Orf#5 














Orf#6 














L5 


Fiber #2 


ZD DOKJ 


ZO^-Z d 


J / JZy 










Fiber #1 




Z-OIJU 


60707 


2951 5 


31116 


56382 


E4 


Orf7 








31441 


31118 


11856 


Orf6 


29255 


28395 


33905 


32292 


31438 


33437 


Orf4 


29550 


29188 


14399 


32587 


32222 


13997 


Orf3 


29902 


29552 


13284 


32954 


32607 


13353 


Orf2 


30291 


29899 


14853 


33348 


32959 


14821 


Orf 1 


30316 


30696 


14301 


33764 


33378 


14235 


ITR 




30912 


31044 




33966 


34115 





The Pan5, Pan6, Pan7, SV1, SV25 and SV39 adenoviral nucleic acid 
5 sequences are useful as therapeutic agents and in construction of a variety of vector 
systems and host cells. As used herein, a vector includes any suitable nucleic acid 
molecule including, naked DNA, a plasmid, a virus, a cosmid, or an episome. These 
sequences and products may be used alone or in combination with other adenoviral 
sequences or fragments, or in combination with elements from other adenoviral or non- 
10 adenoviral sequences. The adenoviral sequences of the invention are also useful as 

antisense delivery vectors, gene therapy vectors, or vaccine vectors. Thus, the invention 
further provides nucleic acid molecules, gene delivery vectors, and host cells which 
contain the Ad sequences of the invention. 
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For example, the invention encompasses a nucleic acid molecule 
containing simian Ad ITR sequences of the invention. In another example, the invention 
provides a nucleic acid molecule containing simian Ad sequences of the invention 
encoding a desired Ad gene product. Still other nucleic acid molecule constructed using 
5 the sequences of the invention will be readily apparent to one of skill in the art, in view of 
the information provided herein. 

In one embodiment, the simian Ad gene regions identified herein may be 
used in a variety of vectors for delivery of a heterologous molecule to a cell. For 
example, vectors are generated for expression of an adenoviral capsid protein (or 
10 fragment thereof) for purposes of generating a viral vector in a packaging host cell. Such 
vectors may be designed for expression in trans. Alternatively, such vectors are designed 
to provide cells which stably contain sequences which express desired adenoviral 
functions, e.g., one or more of El a, El b, the terminal repeat sequences, E2a, E2b, E4, 
E40RF6 region. 

15 In addition, the adenoviral gene sequences and fragments thereof are 

useful for providing the helper functions necessary for production of helper-dependent 
viruses (e.g., adenoviral vectors deleted of essential functions or adeno-associated viruses 
(AAV)). For such production methods, the simian adenoviral sequences of the invention 
are utilized in such a method in a manner similar to those described for the human Ad. 

2 0 However, due to the differences in sequences between the simian adenoviral sequences of 
the invention and those of human Ad, the use of the sequences of the invention essentially 
eliminate the possibility of homologous recombination with helper functions in a host cell 
carrying human Ad El functions, e.g., 293 cells, which may produce infectious 
adenoviral contaminants during rAAV production. 

2 5 Methods of producing rAAV using adenoviral helper functions have been 

described at length in the literature with human adenoviral serotypes. See, e.g., US Patent 
6,258,595 and the references cited therein. See, also, US Patent 5,871,982; WO 
99/14354; WO 99/15685; WO 99/47691. These methods may also be used in production 
of non-human serotype AAV, including non-human primate AAV serotypes. The simian 

3 0 adenoviral gene sequences of the invention which provide the necessary helper functions 

(e.g., Ela, Elb, E2a and/or E4 ORF6) can be particularly useful in providing the 
necessary adenoviral function while minimizing or eliminating the possibility of 



18 



WO 03/046124 



PCT/US02/33645 



recombination with any other adenoviruses present in the rAAV-packaging cell which are 
typically of human origin. Thus, selected genes or open reading frames of the adenoviral 
sequences of the invention may be utilized in these rAAV production methods. 

Alternatively, recombinant adenoviral simian vectors of the invention may 
5 be utilized in these methods. Such recombinant adenoviral simian vectors may include, 
e.g., a hybrid chimp Ad/AAV in which chimp Ad sequences flank a rAAV expression 
cassette composed of, e.g., AAV 3 5 and/or 5> ITRs and a transgene under the control of 
regulatory sequences which control its expression. One of skill in the art will recognize 
that still other simian adenoviral vectors and/or gene sequences of the invention will be 

10 useful for production of rAAV and other viruses dependent upon adenoviral helper. 

In still another embodiment, nucleic acid molecules are designed for 
delivery and expression of selected adenoviral gene products in a host cell to achieve a 
desired physiologic effect. For example, a nucleic acid molecule containing sequences 
encoding an adenovirus El a protein of the invention may be delivered to a subject for use 

15 as a cancer therapeutic. Optionally, such a molecule is formulated in a lipid-based carrier 
and preferentially targets cancer cells. Such a formulation may be combined with other 
cancer therapeutics (e.g., cisplatin, taxol, or the like). Still other uses for the adenoviral 
sequences provided herein will be readily apparent to one of skill in the art. 

In addition, one of skill in the art will readily understand that the Ad 

2 0 sequences of the invention can be readily adapted for use for a variety of viral and non- 
viral vector systems for in vitro, ex vivo or in vivo delivery of therapeutic and 
immunogenic molecules. For example, the Pan5, Pan6, Pan7, SV1, SV25 and/or SV39 
simian Ad genomes of the invention can be utilized in a variety of rAd and non-rAd 
vector systems. Such vectors systems may include, e.g., plasmids, lentiviruses, 

25 retroviruses, poxviruses, vaccinia viruses, and adeno-associated viral systems, among 
others. Selection of these vector systems is not a limitation of the present invention! 

The invention further provides molecules useful for production of the 
simian and simian-derived proteins of the invention. Such molecules which carry 
polynucleotides including the simian Ad DNA sequences of the invention can be in the 

30 form of naked DNA, a plasmid, a virus or any other genetic element. 
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B. Simian Adenoviral Proteins of the Invention 

The invention further provides gene products of the above adenoviruses, 
such as proteins, enzymes, and fragments thereof, which are encoded by the adenoviral 
nucleic acids of the invention. The invention further encompasses Pan5, Pan6 and Pan7, 
5 SV1, SV25 and SV39 proteins, enzymes, and fragments thereof, having the amino acid 
sequences encoded by these nucleic acid sequences which are generated by other 
methods. Such proteins include those encoded by the open reading frames identified in 
the tables above, in Figs. 1 and 2, and fragments thereof. 

Thus, in one aspect, the invention provides unique simian adenoviral 
10 proteins which are substantially pure, i.e., are free of other viral and proteinaceous 

proteins. Preferably, these proteins are at least 10% homogeneous, more preferably 60% 
homogeneous, and most preferably 95% homogeneous. 

In one embodiment, the invention provides unique simian-derived capsid 
proteins. As used herein, a simian-derived capsid protein includes any adenoviral capsid 
15 protein that contains a Pan5, Pan6, Pan7, SV1, SV25 or SV39 capsid protein or a 
fragment thereof, as defined above, including, without limitation, chimeric capsid 
proteins, fusion proteins, artificial capsid proteins, synthetic capsid proteins, and 
recombinantly capsid proteins, without limitation to means of generating these proteins. 

Suitably, these simian-derived capsid proteins contain one or more Pan5, 
2 0 Pan6, Pan7, SV1, SV25 or SV39 regions or fragments thereof (e.g., a hexon, penton, fiber 
or fragment thereof) in combination with capsid regions or fragments thereof of different 
adenoviral serotypes, or modified simian capsid proteins or fragments, as described 
herein. A "modification of a capsid protein associated with altered tropism" as used 
herein includes an altered capsid protein, i.e, a penton, hexon or fiber protein region, or 
2 5 fragment thereof, such as the knob domain of the fiber region, or a polynucleotide 
encoding same, such that specificity is altered. The simian-derived capsid may be 
constructed with one or more of the simian Ad of the invention or another Ad serotypes 
which may be of human or non-human origin. Such Ad may be obtained from a variety of 
sources including the ATCC, commercial and academic sources, or the sequences of the 
30 Ad may be obtained from GenBank or other suitable sources. 

The amino acid sequences of the simian adenoviruses penton proteins of 
the invention are provided herein. The Ad Pan 5 penton protein is provided in SEQ ID 
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NO:2. The AdPan7 penton is provided in SEQ ID NO:6. The AdPan6 penton is 
provided in SEQ ID NO: 10. The SV1 penton is provided in SEQ ID NO:25. The SV25 
penton protein is provided in SEQ ID NO: 30. The SV39 penton is provided in SEQ ID 
NO:35. Suitably, any of these penton proteins, or unique fragments thereof, may be 
5 utilized for a variety of purposes. Examples of suitable fragments include the penton 
having N-terminal and/or C-terminal truncations of about 50, 100, 150, or 200 amino 
acids, based upon the amino acid numbering provided above and in SEQ ID NO:2; SEQ 
ID NO:6; SEQ ID NO:25; SEQ ID NO:30, or SEQ ID NO:35. Other suitable fragments 
include shorter internal, C-terminal, or N-terminal fragments. Further, the penton protein 

10 may be modified for a variety of purposes known to those of skill in the art. 

The invention further provides the amino acid sequences of the hexon 
protein of Pan5 [SEQ ID NO:3], Pan6 [SEQ ID NO:7], Pan 7 [SEQ ID NO:l 1], SV1 
[SEQ ID NO:26], SV25 [SEQ ID NO:3 1], and/or SV39 [SEQ ID NO:36]. Suitably, this 
hexon protein, or unique fragments thereof, may be utilized for a variety of purposes. 

15 Examples of suitable fragments include the hexon having N-terminal and/or C-terminal 
truncations of about 50, 100, 150, 200, 300, 400, or 500 amino acids, based upon the 
amino acid numbering provided above and in SEQ ID NO: 3, 7, 1 1, 26, 31 and 36. Other 
suitable fragments include shorter internal, C-terminal, or N-terminal fragments. For 
example, one suitable fragment the loop region (domain) of the hexon protein, designated 

20 DEI and FG1, or a hypervariable region thereof. Such fragments include the regions 
spanning amino acid residues about 125 to 443; about 138 to 441, or smaller fragments, 
such as those spanning about residue 1 38 to residue 1 63; about 170 to about 176; about 
195 to about 203; about 233 to about 246; about 253 to about 264; about 287 to about 
297; and about 404 to about 430 of the simian hexon proteins, with reference to SEQ ID 

25 NO: 3, 7, 11, 26, 31 or 36. Other suitable fragments may be readily identified by one of 
skill in the art. Further, the hexon protein may be modified for a variety of purposes 
known to those of skill in the art. Because the hexon protein is the determinant for 
serotype of an adenovirus, such artificial hexon proteins would result in adenoviruses 
having artificial serotypes. Other artificial capsid proteins can also be constructed using 

30 the chimp Ad penton sequences and/or fiber sequences of the invention and/or fragments 
thereof. 
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In one example, it may be desirable to generate an adenovirus having an 
altered hexon protein utilizing the sequences of a hexon protein of the invention. One 
suitable method for altering hexon proteins is described in US Patent 5,922,315, which is 
incorporated by reference. In this method, at least one loop region of the adenovirus 
5 hexon is changed with at least one loop region of another adenovirus serotype. Thus, at 
least one loop region of such an altered adenovirus hexon protein is a simian Ad hexon 
loop region of the invention (e.g. Pan7). In one embodiment, a loop region of the Pan7 
hexon protein is replaced by a loop region from another adenovirus serotype. In another 
embodiment, the loop region of the Pan7 hexon is used to replace a loop region from 

10 another adenovirus serotype. Suitable adenovirus serotypes may be readily selected from 
among human and non-human serotypes, as described herein. Pan7 is selected for 
purposes of illustration only; the other simian Ad hexon proteins of the invention may be 
similarly altered, or used to alter another Ad hexon. The selection of a suitable serotype 
is not a limitation of the present invention. Still other uses for the hexon protein 

15 sequences of the invention will be readily apparent to those of skill in the art. 

The invention further encompasses the fiber proteins of the simian 
adenoviruses of the invention. The fiber protein of AdPan 5 has the amino acid sequence 
of SEQ ID NO:4. The fiber protein AdPan6 has the amino acid sequence of SEQ ID NO: 
8. The fiber protein of AdPan7 has the amino acid sequence of SEQ ID NO: 12. SV-1 

20 has two fiber proteins; fiber 2 has the amino acid sequence of SEQ ID NO:27 and fiber 1 
has the amino acid sequence of SEQ ID NO:28. SV-25 also has two fiber proteins; fiber 
2 has the amino acid sequence of SEQ ID NO:32 and fiber 1 has the amino acid sequence 
of SEQ ID NO:33. The fiber protein of SV-39 has the amino acid sequence of SEQ ID 
NO:37. 

2 5 Suitably, this fiber protein, or unique fragments thereof, may be utilized 

for a variety of purposes. One suitable fragment is the fiber knob, which spans about 
amino acids 247 to 425 of SEQ ID NO: 4, 8, 12, 28, 32, 33 and 37. See Fig. 2. Examples 
of other suitable fragments include the fiber having N-terminal and/or C-terminal 
truncations of about 50, 100, 150, or 200 amino acids, based upon the amino acid 

30 numbering provided above and in SEQ ID NO: 4, 8, 12, 28, 32, 33 and 37. Still other 
suitable fragments include internal fragments. Further, the fiber protein may be modified 
using a variety of techniques known to those of skill in the art. 
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The invention further encompasses unique fragments of the proteins of the 
invention which are at least 8 amino acids in length. However, fragments of other desired 
lengths can be readily utilized. In addition, the invention encompasses such 
modifications as may be introduced to enhance yield and/or expression of a Pan5 ? Pan6, 
5 Pan7, SV1, SV25 or SV39 gene product, e.g., construction of a fusion molecule in which 
all or a fragment of the Pan5 5 Pan6, Pan7, SV1, SV25 or SV39 gene product is fused 
(either directly or via a linker) with a fusion partner to enhance. Other suitable 
modifications include, without limitation, truncation of a coding region (e.g., a protein or 
enzyme) to eliminate a pre- or pro-protein ordinarily cleaved and to provide the mature 

10 protein or enzyme and/or mutation of a coding region to provide a secretable gene 

product. Still other modifications will be readily apparent to one of skill in the art. The 
invention further encompasses proteins having at least about 95% to 99% identity to the 
Pan5, Pan6, Pan7, SV1, SV25 or SV39 proteins provided herein. 

As described herein, vectors of the invention containing the adenoviral 

15 capsid proteins of the invention are particularly well suited for use in applications in 

which the neutralizing antibodies diminish the effectiveness of other Ad serotype based 
vectors, as well as other viral vectors. The rAd vectors of the invention are particularly 
advantageous in readministration for repeat gene therapy or for boosting immune 
response (vaccine titers). 

2 0 Under certain circumstances, it may be desirable to use one or more of the 

Pan5, Pan6, Pan7, SV1, SV25 and/or SV39 gene products (e.g., a capsid protein or a 
fragment thereof) to generate an antibody. The term "an antibody," as used herein, refers 
to an immunoglobulin molecule which is able to specifically bind to an epitope. Thus, 
the antibodies of the invention bind, preferably specifically and without cross-reactivity, 

25 to a Pan5, Pan6, Pan7, SV1, SV25 or SV39 epitope. The antibodies in the present 
invention exist in a variety of forms including, for example, high affinity polyclonal 
antibodies, monoclonal antibodies, synthetic antibodies, chimeric antibodies, recombinant 
antibodies and humanized antibodies, Such antibodies originate from immunoglobulin 
classes IgG, IgM, IgA, IgD and IgE. 

30 Such antibodies may be generated using any of a number of methods 

know in the art. Suitable antibodies may be generated by well-known conventional 
techniques, e.g. Kohler and Milstein and the many known modifications thereof. 
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Similarly desirable high titer antibodies are generated by applying known recombinant 
techniques to the monoclonal or polyclonal antibodies developed to these antigens [see, 
e.g., PCT Patent Application No. PCT/GB85/00392; British Patent Application 
Publication No. GB2188638A; Amit et ah 9 1986 Science, 233:747-753; Queen et al 9 
5 1989 Proc. Natl Acad Sci. USA, 86: 1 0029-10033; PCT Patent Application No. 

PCT/WO9007861 ; and Riechmann et ah 9 Nature, 332:323-327 (1988); Huse et al 9 1988a 
Science, 246:1275-1281]. Alternatively, antibodies can be produced by manipulating the 
complementarity determining regions of animal or human antibodies to the antigen of this 
invention. See, e.g., E. Mark and Padlin, "Humanization of Monoclonal Antibodies", 

10 Chapter 4, The Handbook of Experimental Pharmacology, Vol. 113, The Pharmacology 
of Monoclonal Antibodies, Springer-Verlag (June, 1994); Harlow et ah, 1999, Using 
Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, NY; Harlow et 
ah, 1989, Antibodies: A Laboratory Manual, Cold Spring Harbor, New York; Houston et 
ah, 1988, Proc. Natl Acad. Sci. USA £5:5879-5883; and Bird et ah, 1988, Science 

15 242:423-426. Further provided by the present invention are anti-idiotype antibodies 
(Ab2) and anti-anti-id iotype antibodies (Ab3). See, e.g., M. Wettendorff et al., 
"Modulation of anti-tumor immunity by anti-idiotypic antibodies." In Idiotypic Network 
and Diseases, ed. by J. Cerny and J. Hiernaux, 1990 J. Am. Soc. Microbiol., Washington 
DC: pp. 203-229]. These anti-idiotype and anti-anti-idiotype antibodies are produced 

2 0 using techniques well known to those of skill in the art. These antibodies may be used for 
a variety of purposes, including diagnostic and clinical methods and kits. 

Under certain circumstances, it may be desirable to introduce a detectable 
label or a tag onto a Pan5, Pan6, Pan7, SV1, SV25 or SV39 gene product, antibody or 
other construct of the invention. As used herein, a detectable label is a molecule which is 

25 capable, alone or upon interaction with another molecule, of providing a detectable signal. 
Most desirably, the label is detectable visually, e.g. by fluorescence, for ready use in 
immunohistochemical analyses or immunofluorescent microscopy. For example, suitable 
labels include fluorescein isothiocyanate (FITC), phycoerythrin (PE), allophycocyanin 
(APC), coriphosphine-O (CPO) or tandem dyes, PE-cyanin-5 (PCS), and PE-Texas Red 

30 (ECD). All of these fluorescent dyes are commercially available, and their uses known to 
the art. Other useful labels include a colloidal gold label. Still other useful labels include 
radioactive compounds or elements. Additionally, labels include a variety of enzyme 
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systems that operate to reveal a colorimetric signal in an assay, e.g., glucose oxidase 
(which uses glucose as a substrate) releases peroxide as a product which in the presence 
of peroxidase and a hydrogen donor such as tetramethyl benzidine (TMB) produces an 
oxidized TMB that is seen as a blue color. Other examples include horseradish 
5 peroxidase (HRP) or alkaline phosphatase (AP), and hexokinase in conjunction with 

glucose-6-phosphate dehydrogenase which reacts with ATP, glucose, and NAD+ to yield, 
among other products, NADH that is detected as increased absorbance at 340 nm 
wavelength. 

Other label systems that are utilized in the methods of this invention are 

10 detectable by other means, e.g., colored latex microparticles [Bangs Laboratories, 

Indiana] in which a dye is embedded are used in place of enzymes to form conjugates 
with the target sequences provide a visual signal indicative of the presence of the 
resulting complex in applicable assays. 

Methods for coupling or associating the label with a desired molecule are 

15 similarly conventional and known to those of skill in the art. Known methods of label 
attachment are described [see, for example, Handbook of Fluorescent probes and 
Research Chemicals, 6th Ed., R. P. M. Haugland, Molecular Probes, Inc., Eugene, OR, 
1996; Pierce Catalog and Handbook, Life Science and Analytical Research Products, 
Pierce Chemical Company, Rockford, IL, 1994/1995]. Thus, selection of the label and 

2 0 coupling methods do not limit this invention. 

The sequences, proteins, and fragments of the invention may be produced 
by any suitable means, including recombinant production, chemical synthesis, or other 
synthetic means. Suitable production techniques are well known to those of skill in the 
art. See, e.g., Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring 

25 Harbor Press (Cold Spring Harbor, NY). Alternatively, peptides can also be synthesized 
by the well known solid phase peptide synthesis methods (Merrifield, J. Am. Chem. Soc, 
85:2149 (1962); Stewart and Young, Solid Phase Peptide Synthesis (Freeman, San 
Francisco, 1969) pp. 27-62). These and other suitable production methods are within the 
knowledge of those of skill in the art and are not a limitation of the present invention. 

30 In addition, one of skill in the art will readily understand that the Ad 

sequences of the invention can be readily adapted for use for a variety of viral and non- 
viral vector systems for in vitro, ex vivo or in vivo delivery of therapeutic and 
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immunogenic molecules. For example, in one embodiment, the simian Ad capsid 
proteins and other simian adenovirus proteins described herein are used for non-viral, 
protein-based delivery of genes, proteins, and other desirable diagnostic, therapeutic and 
immunogenic molecules. In one such embodiment, a protein of the invention is linked, 
5 directly or indirectly, to a molecule for targeting to cells with a receptor for adenoviruses. 
Preferably, a capsid protein such as a hexon, penton, fiber or a fragment thereof having a 
ligand for a cell surface receptor is selected for such targeting. Suitable molecules for 
delivery are selected from among the therapeutic molecules described herein and their 
gene products. A variety of linkers including, lipids, polyLys, and the like may be 

10 utilized as linkers. For example, the simian penton protein may be readily utilized for 
such a purpose by production of a fusion protein using the simian penton sequences in a 
manner analogous to that described in Medina-Kauwe LK, et al, Gene Ther. 2001 May; 
8(10):795-803 and Medina-Kauwe LK, et al, Gene Ther. 2001 Dec; 8(23): 1753-1761. 
Alternatively, the amino acid sequences of simian Ad protein IX may be utilized for 

15 targeting vectors to a cell surface receptor, as described in US Patent Appln 

20010047081. Suitable ligands include a CD40 antigen, an RGD-containing or 
polylysine-containing sequence, and the like. Still other simian Ad proteins, including, 
e.g., the hexon protein and/or the fiber protein, may be used for used for these and similar 
purposes. 

20 Still other adenoviral proteins of the invention may be used as alone, or in 

combination with other adenoviral protein, for a variety of purposes which will be readily 
apparent to one of skill in the art. In addition, still other uses for the adenoviral proteins 
of the invention will be readily apparent to one of skill in the art. 

25 II. Recombinant Adenoviral Vectors 

The compositions of this invention include vectors that deliver a heterologous 
molecule to cells, either for therapeutic or vaccine purposes. As used herein, a vector 
may include any genetic element including, without limitation, naked DNA, a phage, 
transposon, cosmid, episome, plasmid, or a virus. Such vectors contain simian 

30 adenovirus DNA of Pan5, Pan6, Pan7, SV1, SV25 and/or SV39 and a minigene. By 
"minigene" is meant the combination of a selected heterologous gene and the other 
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regulatory elements necessary to drive translation, transcription and/or expression of the 
gene product in a host cell. 

Typically, an adenoviral vector of the invention is designed such that the 
minigene is located in a nucleic acid molecule which contains other adenoviral sequences 
5 in the region native to a selected adenoviral gene. The minigene may be inserted into an 
existing gene region to disrupt the function of that region, if desired. Alternatively, the 
minigene may be inserted into the site of a partially or fully deleted adenoviral gene. For 
example, the minigene may be located in the site of such as the site of a functional El 
deletion or functional E3 deletion, among others that may be selected. The term 

10 "functionally deleted" or "functional deletion" means that a sufficient amount of the gene 
region is removed or otherwise damaged, e.g., by mutation or modification, so that the 
gene region is no longer capable of producing functional products of gene expression. If 
desired, the entire gene region may be removed. Other suitable sites for gene disruption or 
deletion are discussed elsewhere in the application. 

15 For example, for a production vector useful for generation of a recombinant virus, 

the vector may contain the minigene and either the 5 ? end of the adenoviral genome or the 
3 5 end of the adenoviral genome, or both the 5 ? and 3 5 ends of the adenoviral genome. 
The 5' end of the adenoviral genome contains the 5 ? cis-elements necessary for packaging 
and replication; i.e., the 5' inverted terminal repeat (ITR) sequences (which functions as 

20 origins of replication) and the native 5' packaging enhancer domains (that contain 

sequences necessary for packaging linear Ad genomes and enhancer elements for the El 
promoter). The 3' end of the adenoviral genome includes the 3 ? cis-elements (including 
the ITRs) necessary for packaging and encapsidation. Suitably, a recombinant adenovirus 
contains both 5 ? and 3' adenoviral cis-elements and the minigene is located between the 

25 5' and 3' adenoviral sequences. Any adenoviral vector of the invention may also contain 
additional adenoviral sequences. 

Suitably, these adenoviral vectors of the invention contain one or more adenoviral 
elements derived from an adenoviral genome of the invention. In one embodiment, the 
vectors contain adenoviral ITRs from Pan5, Pan6, Pan7, SV1, SV25 or SV39 and 

30 additional adenoviral sequences from the same adenoviral serotype. In another 

embodiment, the vectors contain adenoviral sequences that are derived from a different 
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adenoviral serotype than that which provides the ITRs. As defined herein, a pseudotyped 
adenovirus refers to an adenovirus in which the capsid protein of the adenovirus is from a 
different serotype than the serotype which provides the ITRs. The selection of the 
serotype of the ITRs and the serotype of any other adenoviral sequences present in vector 
5 is not a limitation of the present invention. A variety of adenovirus strains are available 
from the American Type Culture Collection, Manassas, Virginia, or available by request 
from a variety of commercial and institutional sources. Further, the sequences of many 
such strains are available from a variety of databases including, e.g., PubMed and 
GenBank. Homologous adenovirus vectors prepared from other simian or from human 

10 adenoviruses are described in the published literature [see, for example, US Patent No. 
5,240,846]. The DNA sequences of a number of adenovirus types are available from 
GenBank, including type Ad5 [GenBank Accession No. M73260]. The adenovirus 
sequences may be obtained from any known adenovirus serotype, such as serotypes 2, 3, 
4, 7, 12 and 40, and further including any of the presently identified human types. 

15 Similarly adenoviruses known to infect non-human animals (e.g., simians) may also be 
employed in the vector constructs of this invention. See, e.g., US Patent No. 6,083,716. 

The viral sequences, helper viruses, if needed, and recombinant viral particles, 
and other vector components and sequences employed in the construction of the vectors 
described herein are obtained as described above. The DNA sequences of the Pan5, Pan6, 

20 Pan7, SV1, SV25 and/or SV39 simian adenovirus sequences of the invention are 

employed to construct vectors and cell lines useful in the preparation of such vectors. 

Modifications of the nucleic acid sequences forming the vectors of this invention, 
including sequence deletions, insertions, and other mutations may be generated using 
standard molecular biological techniques and are within the scope of this invention. 

25 A. The "Minigene " 

The methods employed for the selection of the transgene, the cloning and 
construction of the "minigene" and its insertion into the viral vector are within the skill in 
the art given the teachings provided herein. 

1 . The transgene 

30 The transgene is a nucleic acid sequence, heterologous to the 

vector sequences flanking the transgene, which encodes a polypeptide, protein, or other 
product, of interest. The nucleic acid coding sequence is operatively linked to regulatory 
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components in a manner which permits transgene transcription, translation, and/or 
expression in a host cell. 

The composition of the transgene sequence will depend upon the 
use to which the resulting vector will be put. For example, one type of transgene 
5 sequence includes a reporter sequence, which upon expression produces a detectable 

signal. Such reporter sequences include, without limitation, DNA sequences encoding p~ 
lactamase, p-galactosidase (LacZ), alkaline phosphatase, thymidine kinase, green 
fluorescent protein (GFP), chloramphenicol acetyltransferase (CAT), luciferase, 
membrane bound proteins including, for example, CD2, CD4, CD8, the influenza 

10 hemagglutinin protein, and others well known in the art, to which high affinity antibodies 
directed thereto exist or can be produced by conventional means, and fusion proteins 
comprising a membrane bound protein appropriately fused to an antigen tag domain from, 
among others, hemagglutinin or Myc. These coding sequences, when associated with 
regulatory elements which drive their expression, provide signals detectable by 

15 conventional means, including enzymatic, radiographic, colorimetric, fluorescence or 

other spectrograph^ assays, fluorescent activating cell sorting assays and immunological 
assays, including enzyme linked immunosorbent assay (ELISA), radioimmunoassay 
(RIA) and immunohistochemistry. For example, where the marker sequence is the LacZ 
gene, the presence of the vector carrying the signal is detected by assays for beta- 

2 0 galactosidase activity. Where the transgene is GFP or luciferase, the vector carrying the 
signal may be measured visually by color or light production in a luminometer. 

However, desirably, the transgene is a non-marker sequence encoding a 
product which is useful in biology and medicine, such as proteins, peptides, RNA, 
enzymes, or catalytic RNAs. Desirable RNA molecules include tRNA, dsRNA, 

25 ribosomal RNA, catalytic RNAs, and antisense RNAs. One example of a useful RNA 

sequence is a sequence which extinguishes expression of a targeted nucleic acid sequence 
in the treated animal. 

The transgene may be used for treatment, e.g., of genetic deficiencies, as a 
cancer therapeutic or vaccine, for induction of an immune response, and/or for 

30 prophylactic vaccine purposes. As used herein, induction of an immune response refers 
to the ability of a molecule (e.g., a gene product) to induce a T cell and/or a humoral 
immune response to the molecule. The invention further includes using multiple 
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transgenes, e.g., to correct or ameliorate a condition caused by a multi-subunit protein. In 
certain situations, a different transgene may be used to encode each subunit of a protein, 
or to encode different peptides or proteins. This is desirable when the size of the DNA 
encoding the protein subunit is large, e.g., for an immunoglobulin, the platelet-derived 
5 growth factor, or a dystrophin protein. In order for the cell to produce the multi-subunit 
protein, a cell is infected with the recombinant virus containing each of the different 
subunits. Alternatively, different subunits of a protein may be encoded by the same 
transgene. In this case, a single transgene includes the DNA encoding each of the 
subunits, with the DNA for each subunit separated by an internal ribozyme entry site 

10 (IRES). This is desirable when the size of the DNA encoding each of the subunits is 

small, e.g., the total size of the DNA encoding the subunits and the IRES is less than five 
kilobases. As an alternative to an IRES, the DNA may be separated by sequences 
encoding a 2A peptide, which self-cleaves in a post-translational event. See, e.g., M.L. 
Donnelly, et al f J. Gen, Virol, 78(Pt 1):13-21 (Jan 1997); Furler, S., et al. Gene Ther., 

15 8(ll):864-873 (June 2001); Klump H., et aL, Gene Ther., 8(10):81 1-817 (May 2001). 

This 2A peptide is significantly smaller than an IRES, making it well suited for use when 
space is a limiting factor. However, the selected transgene may encode any biologically 
active product or other product, e.g., a product desirable for study. 

Suitable transgenes may be readily selected by one of skill in the art. The 

20 selection of the transgene is not considered to be a limitation of this invention. 
2. Regulatory Elements 

In addition to the major elements identified above for the 
minigene, the vector also includes conventional control elements necessary which are 
operably linked to the transgene in a manner that permits its transcription, translation 

2 5 and/or expression in a cell transfected with the plasmid vector or infected with the virus 
produced by the invention. As used herein, "operably linked" sequences include both 
expression control sequences that are contiguous with the gene of interest and expression 
control sequences that act in trans or at a distance to control the gene of interest. 

Expression control sequences include appropriate transcription 

30 initiation, termination, promoter and enhancer sequences; efficient RNA processing 
signals such as splicing and polyadenylation (polyA) signals; sequences that stabilize 
cytoplasmic mRNA; sequences that enhance translation efficiency (i.e., Kozak consensus 
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sequence); sequences that enhance protein stability; and when desired, sequences that 
enhance secretion of the encoded product. A great number of expression control 
sequences, including promoters which are native, constitutive, inducible and/or tissue- 
specific, are known in the art and may be utilized. 
5 Examples of constitutive promoters include, without limitation, 

the retroviral Rous sarcoma virus (RSV) LTR promoter (optionally with the RSV 
enhancer), the cytomegalovirus (CMV) promoter (optionally with the CMV enhancer) 
[see, e.g., Boshart et al, Cell, 41:521-530 (1985)], the SV40 promoter, the dihydrofolate 
reductase promoter, the P-actin promoter, the phosphoglycerol kinase (PGK) promoter, 

1 0 and the EF 1 a promoter [Invitrogen] . 

Inducible promoters allow regulation of gene expression and can 
be regulated by exogenously supplied compounds, environmental factors such as 
temperature, or the presence of a specific physiological state, e.g., acute phase, a 
particular differentiation state of the cell, or in replicating cells only. Inducible 

15 promoters and inducible systems are available from a variety of commercial sources, 

including, without limitation, Invitrogen, Clontech and Ariad. Many other systems have 
been described and can be readily selected by one of skill in the art. For example, 
inducible promoters include the zinc-inducible sheep metallothionine (MT) promoter and 
the dexamethasone (Dex)-inducible mouse mammary tumor virus (MMTV) promoter. 

2 0 Other inducible systems include the T7 polymerase promoter system [WO 98/10088]; the 
ecdysone insect promoter [No et al, Proc. Natl. Acad. Set USA, 93:3346-3351 (1996)], 
the tetracycline-repressible system [Gossen et ah Proc. Natl. Acad. Set USA, 89:5547- 
5551 (1992)], the tetracycline-inducible system [Gossen et ah Science, 268:1766-1769 
(1995), see also Harvey et al, Curr. Opin. Chem. Biol, 2:512-518 (1998)]. Other systems 

25 include the FK506 dimer, VP 16 or p65 using castradiol, diphenol murislerone, the 

RU486-inducible system [Wang et al, Nat Biotech, 15:239-243 (1997) and Wang et al, 
Gene Ther., 4:432-441 (1997)] and the rapamycin-inducible system [Magari et al, J. Clin, 
Invest, 100:2865-2872 (1997)]. The effectiveness of some inducible promoters 
increases over time. In such cases one can enhance the effectiveness of such systems by 

30 inserting multiple repressors in tandem, e.g., TetR linked to a TetR by an IRES. 

Alternatively, one can wait at least 3 days before screening for the desired function. One 
can enhance expression of desired proteins by known means to enhance the effectiveness 
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of this system. For example, using the Woodchuck Hepatitis Virus Posttranscriptional 
Regulatory Element (WPRE). 

In another embodiment, the native promoter for the transgene will 
be used. The native promoter may be preferred when it is desired that expression of the 
5 transgene should mimic the native expression. The native promoter may be used when 
expression of the transgene must be regulated temporally or developmentally, or in a 
tissue-specific manner, or in response to specific transcriptional stimuli. In a further 
embodiment, other native expression control elements, such as enhancer elements, 
polyadenylation sites or Kozak consensus sequences may also be used to mimic the 
10 native expression. 

Another embodiment of the transgene includes a transgene 
operably linked to a tissue-specific promoter. For instance, if expression in skeletal 
muscle is desired, a promoter active in muscle should be used. These include the 
promoters from genes encoding skeletal (3-actin, myosin light chain 2A, dystrophin, 

15 muscle creatine kinase, as well as synthetic muscle promoters with activities higher than 
naturally occurring promoters (see Li et al, Nat. Biotech., 17:241-245 (1999)). Examples 
of promoters that are tissue-specific are known for liver (albumin, Miyatake et al, J. 
Virol, 71:5124-32 (1997); hepatitis B virus core promoter, Sandig etal, Gene Ther., 
3:1002-9 (1996); alpha-fetoprotein (AFP), Arbuthnot et al, Hum. Gene Ther., 7:1503-14 

20 (1996)), bone osteocalcin (Stein et al, Mol Biol. Rep., 24:185-96 (1997)); bone 

sialoprotein (Chen et al, J. Bone Miner. Res., 11:654-64 (1996)), lymphocytes (CD2, 
Hansal et al, J. Immunol, 161:1063-8 (1998); immunoglobulin heavy chain; T cell 
receptor chain), neuronal such as neuron-specific enolase (NSE) promoter (Andersen et 
al, Cell Mol NeurobioL, 13:503-15 (1993)), neurofilament light-chain gene (Piccioli et 

25 al, Proc. Natl Acad. Sci. USA, 88:561 1-5 (1991)), and the neuron-specific vgf gene 
(Piccioli et al, Neuron, 15:373-84 (1 995)), among others. 

Optionally, vectors carrying transgenes encoding therapeutically useful or 
immunogenic products may also include selectable markers or reporter genes may include 
sequences encoding geneticin, hygromicin or purimycin resistance, among others. Such 

30 selectable reporters or marker genes (preferably located outside the viral genome to be 
packaged into a viral particle) can be used to signal the presence of the plasmids in 
bacterial cells, such as ampicillin resistance. Other components of the vector may include 
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an origin of replication. Selection of these and other promoters and vector elements are 
conventional and many such sequences are available [see, e.g., Sambrook et al, and 
references cited therein]. 

These vectors are generated using the techniques and sequences provided 
5 herein, in conjunction with techniques known to those of skill in the art. 

Such techniques include conventional cloning techniques of cDNA such as those 
described in texts [Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Press, Cold Spring Harbor, NY], use of overlapping oligonucleotide 
sequences of the adenovirus genomes, polymerase chain reaction, and any suitable 
10 method which provides the desired nucleotide sequence. 

III. Production of the Recombinant Viral Particle 

In one embodiment, the simian adenoviral plasmids (or other vectors) are used to 
produce recombinant adenoviral particles. In one embodiment, the recombinant 

15 adenoviruses are functionally deleted in the Ela or Elb genes, and optionally bearing 
other mutations, e.g., temperature-sensitive mutations or deletions in other genes. In 
other embodiments, it is desirable to retain an intact Ela and/or Elb region in the 
recombinant adenoviruses. Such an intact El region may be located in its native location 
in the adenoviral genome or placed in the site of a deletion in the native adenoviral 

20 genome (e.g., in the E3 region). 

In the construction of useful simian adenovirus vectors for delivery of a gene to 
the human (or other mammalian) cell, a range of adenovirus nucleic acid sequences can 
be employed in the vectors. For example, all or a portion of the adenovirus delayed early 
gene E3 may be eliminated from the simian adenovirus sequence which forms a part of 

2 5 the recombinant virus. The function of simian E3 is believed to be irrelevant to the 
function and production of the recombinant virus particle. Simian adenovirus vectors 
may also be constructed having a deletion of at least the ORF6 region of the E4 gene, and 
more desirably because of the redundancy in the function of this region, the entire E4 
region. Still another vector of this invention contains a deletion in the delayed early gene 

30 E2a. Deletions may also be made in any of the late genes LI through L5 of the simian 
adenovirus genome. Similarly, deletions in the intermediate genes IX and IVa 2 may be 
useful for some purposes. Other deletions may be made in the other structural or non- 
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structural adenovirus genes. The above discussed deletions may be used individually, 
i.e., an adenovirus sequence for use in the present invention may contain deletions in only 
a single region. Alternatively, deletions of entire genes or portions thereof effective to 
destroy their biological activity may be used in any combination. For example, in one 
5 exemplary vector, the adenovirus sequence may have deletions of the El genes and the 
E4 gene, or of the El, E2a and E3 genes, or of the El and E3 genes, or of El , E2a and E4 
genes, with or without deletion of E3, and so on. As discussed above, such deletions may 
be used in combination with other mutations, such as temperature-sensitive mutations, to 
achieve a desired result. 

10 An adenoviral vector lacking any essential adenoviral sequences (e.g., Ela, Elb, 

E2a, E2b, E4 ORF6, LI, L2, L3, L4 and L5) may be cultured in the presence of the 
missing adenoviral gene products which are required for viral infectivity and propagation 
of an adenoviral particle. These helper functions may be provided by culturing the 
adenoviral vector in the presence of one or more helper constructs (e.g., a plasmid or 

15 virus) or a packaging host cell. See, for example, the techniques described for preparation 
of a "minimal" human Ad vector in International Patent Application W096/13597, 
published May 9, 1996, and incorporated herein by reference. 
1 . Helper Viruses 

Thus, depending upon the simian adenovirus gene content of the 

20 viral vectors employed to carry the minigene, a helper adenovirus or non-replicating virus 
fragment may be necessary to provide sufficient simian adenovirus gene sequences 
necessary to produce an infective recombinant viral particle containing the minigene. 
Useful helper viruses contain selected adenovirus gene sequences not present in the 
adenovirus vector construct and/or not expressed by the packaging cell line in which the 

25 vector is transfected. In one embodiment, the helper virus is replication-defective and 
contains a variety of adenovirus genes in addition to the sequences described above. 
Such a helper virus is desirably used in combination with an El -expressing cell line. 

Helper viruses may also be formed into poly-cation conjugates as 
described in Wu et ah J. Biol Chem., 264:16985-16987 (1989); K. J. Fisher and J. M. 

30 Wilson, Biochem. J., 299:49 (April 1, 1994). Helper virus may optionally contain a 

second reporter minigene. A number of such reporter genes are known to the art. The 
presence of a reporter gene on the helper virus which is different from the transgene on 
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the adenovirus vector allows both the Ad vector and the helper virus to be independently 
monitored. This second reporter is used to enable separation between the resulting 
recombinant virus and the helper virus upon purification. 

2. Complementation Cell Lines 
5 To generate recombinant simian adenoviruses (Ad) deleted in any 

of the genes described above, the function of the deleted gene region, if essential to the 
replication and infectivity of the virus, must be supplied to the recombinant virus by a 
helper virus or cell line, i.e., a complementation or packaging cell line. In many 
circumstances, a cell line expressing the human El can be used to transcomplement the 

10 chimp Ad vector. This is particularly advantageous because, due to the diversity between 
the chimp Ad sequences of the invention and the human AdEl sequences found in 
currently available packaging cells, the use of the current human El -containing cells 
prevents the generation of replication-competent adenoviruses during the replication and 
production process. However, in certain circumstances, it will be desirable to utilize a 

15 cell line which expresses the El gene products can be utilized for production of an El- 
deleted simian adenovirus. Such cell lines have been described. See, e.g., US Patent 
6,083,716. 

If desired, one may utilize the sequences provided herein to 
generate a packaging cell or cell line that expresses, at a minimum, the adenovirus El 

20 gene from Pan5, Pan6, Pan7, SV1, SV25 or SV39 under the transcriptional control of a 
promoter for expression in a selected parent cell line. Inducible or constitutive promoters 
may be employed for this purpose. Examples of such promoters are described in detail 
elsewhere in this specification. A parent cell is selected for the generation of a novel cell 
line expressing any desired AdPan5, Pan6, Pan7, SV1, SV25 or SV39 gene. Without 

2 5 limitation, such a parent cell line may be HeLa [ATCC Accession No. CCL 2], A549 
[ATCC Accession No. CCL 185], HEK 293, KB [CCL 17], Detroit [e.g., Detroit 510, 
CCL 72] and WI-38 [CCL 75] cells, among others. These cell lines are all available from 
the American Type Culture Collection, 10801 University Boulevard, Manassas, Virginia 
201 10-2209. Other suitable parent cell lines may be obtained from other sources. 

30 Such El -expressing cell lines are useful in the generation of 

recombinant simian adenovirus El deleted vectors. Additionally, or alternatively, the 
invention provides cell lines that express one or more simian adenoviral gene products, 
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e.g., El a, Elb, E2a, and/or E4 ORF6, can be constructed using essentially the same 
procedures for use in the generation of recombinant simian viral vectors. Such cell lines 
can be utilized to transcomplement adenovirus vectors deleted in the essential genes that 
encode those products, or to provide helper functions necessary for packaging of a helper- 
5 dependent virus (e.g., adeno-associated virus). The preparation of a host cell according to 
this invention involves techniques such as assembly of selected DNA sequences. This 
assembly may be accomplished utilizing conventional techniques. Such techniques 
include cDNAand genomic cloning, which are well known and are described in 
Sambrook et al., cited above, use of overlapping oligonucleotide sequences of the 

10 adenovirus genomes, combined with polymerase chain reaction, synthetic methods, and 
any other suitable methods which provide the desired nucleotide sequence. 

In still another alternative, the essential adenoviral gene products 
are provided in trans by the adenoviral vector and/or helper virus. In such an instance, a 
suitable host cell can be selected from any biological organism, including prokaryotic 

15 (e.g., bacterial) cells, and eukaryotic cells, including, insect cells, yeast cells and 
mammalian cells. Particularly desirable host cells are selected from among any 
mammalian species, including, without limitation, cells such as A549, WEHI, 3T3, 
10T1/2, HEK 293 cells or PERC6 (both of which express functional adenoviral El) 
[Fallaux, FJ et a/, (1998), Hum Gene Ther, 9:1909-1917], Saos, C2C12, L cells, HT1080, 

2 0 HepG2 and primary fibroblast, hepatocyte and myoblast cells derived from mammals 
including human, monkey, mouse, rat, rabbit, and hamster. The selection of the 
mammalian species providing the cells is not a limitation of this invention; nor is the type 
of mammalian cell, i.e., fibroblast, hepatocyte, tumor cell, etc. 

3 . Assembly of Viral Particle and Transfection of a Cell Line 

25 Generally, when delivering the vector comprising the minigene by 

transfection, the vector is delivered in an amount from about 5 jug to about 100 \xg DNA, 
and preferably about 10 to about 50 |xg DNA to about 1 x 10 4 cells to about 1 x 10 13 cells, 
and preferably about 10 5 cells. However, the relative amounts of vector DNA to host 
cells may be adjusted, taking into consideration such factors as the selected vector, the 

30 delivery method and the host cells selected. 

The vector may be any vector known in the art or disclosed above, 
including naked DNA, a plasmid, phage, transposon, cosmids, episomes, viruses, etc. 
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Introduction into the host cell of the vector may be achieved by any means known in the 
art or as disclosed above, including transfection, and infection. One or more of the 
adenoviral genes may be stably integrated into the genome of the host cell, stably 
expressed as episomes, or expressed transiently. The gene products may all be expressed 
5 transiently, on an episome or stably integrated, or some of the gene products may be 
expressed stably while others are expressed transiently. Furthermore, the promoters for 
each of the adenoviral genes may be selected independently from a constitutive promoter, 
an inducible promoter or a native adenoviral promoter. The promoters may be regulated 
by a specific physiological state of the organism or cell (i.e., by the differentiation state or 

10 in replicating or quiescent cells) or by exogenously-added factors, for example. 

Introduction of the molecules (as plasmids or viruses) into the host 
cell may also be accomplished using techniques known to the skilled artisan and as 
discussed throughout the specification. In preferred embodiment, standard transfection 
techniques are used, e.g., CaP0 4 transfection or electroporation. 

15 Assembly of the selected DNA sequences of the adenovirus (as 

well as the transgene and other vector elements into various intermediate plasmids, and 
the use of the plasmids and vectors to produce a recombinant viral particle are all 
achieved using conventional techniques. Such techniques include conventional cloning 
techniques of cDNA such as those described in texts [Sambrook et al, cited above], use of 

20 overlapping oligonucleotide sequences of the adenovirus genomes, polymerase chain 
reaction, and any suitable method which provides the desired nucleotide sequence. 
Standard transfection and co-transfection techniques are employed, e.g., CaP0 4 
precipitation techniques. Other conventional methods employed include homologous 
recombination of the viral genomes, plaquing of viruses in agar overlay, methods of 

25 measuring signal generation, and the like. 

For example, following the construction and assembly of the desired 
minigene-containing viral vector, the vector is transfected in vitro in the presence of a 
helper virus into the packaging cell line. Homologous recombination occurs between the 
helper and the vector sequences, which permits the adenovirus-transgene sequences in the 

30 vector to be replicated and packaged into virion capsids, resulting in the recombinant viral 
vector particles. The current method for producing such virus particles is transfection- 
based. However, the invention is not limited to such methods. 
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The resulting recombinant simian adenoviruses are useful in transferring a 
selected transgene to a selected cell. In in vivo experiments with the recombinant virus 
grown in the packaging cell lines, the El -deleted recombinant simian adenoviral vectors 
of the invention demonstrate utility in transferring a transgene to a non-simian, preferably 
5 a human, cell. 

IV. Use of the Recombinant Adenovirus Vectors 

The recombinant simian adenovirus vectors of the invention are useful for gene 
transfer to a human or non-simian veterinary patient in vitro, ex vivo, and in vivo. 

10 The recombinant adenovirus vectors described herein can be used as expression 

vectors for the production of the products encoded by the heterologous genes in vitro. 
For example, the recombinant adenoviruses containing a gene inserted into the location of 
an El deletion may be transfected into an El -expressing cell line as described above. 
Alternatively, replication-competent adenoviruses may be used in another selected cell 

15 line. The transfected cells are then cultured in the conventional manner, allowing the 
recombinant adenovirus to express the gene product from the promoter. The gene 
product may then be recovered from the culture medium by known conventional methods 
of protein isolation and recovery from culture. 

A Pan5, Pan6, Pan7, SV1, SV25 or SV39-derived recombinant simian adenoviral 

20 vector of the invention provides an efficient gene transfer vehicle that can deliver a 

selected transgene to a selected host cell in vivo or ex vivo even where the organism has 
neutralizing antibodies to one or more AAV serotypes. In one embodiment, the rAAV 
and the cells are mixed ex vivo; the infected cells are cultured using conventional 
methodologies; and the transduced cells are re-infused into the patient. These 

25 compositions are particularly well suited to gene delivery for therapeutic purposes and for 
immunization, including inducing protective immunity. 

More commonly, the Pan 5, Pan6, Pan7, SV1, SV25, or SV39 recombinant 
adenoviral vectors of the invention will be utilized for delivery of therapeutic or 
immunogenic molecules, as described below. It will be readily understood for both 

30 applications, that the recombinant adenoviral vectors of the invention are particularly well 
suited for use in regimens involving repeat delivery of recombinant adenoviral vectors. 
Such regimens typically involve delivery of a series of viral vectors in which the viral 
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capsids are alternated. The viral capsids may be changed for each subsequent 
administration, or after a pre-selected number of administrations of a particular serotype 
capsid (e.g., one, two, three, four or more). Thus, a regimen may involve delivery of a 
rAd with a first simian capsid, delivery with a rAd with a second simian capsid, and 
5 delivery with a third simian capsid. A variety of other regimens which use the Ad capsids 
of the invention alone, in combination with one another, or in combination with other Ad 
serotypes will be apparent to those of skill in the art. Optionally, such a regimen may 
involve administration of rAd with capsids of other non-human primate adenoviruses, 
human adenoviruses, or artificial serotypes such as are described herein. Each phase of 

10 the regimen may involve administration of a series of injections (or other delivery routes) 
with a single Ad serotype capsid followed by a series with another Ad serotype capsid. 
Alternatively, the recombinant Ad vectors of the invention may be utilized in regimens 
involving other non-adenoviral-mediated delivery systems, including other viral systems, 
non-viral delivery systems, protein, peptides, and other biologically active molecules. 

15 The following sections will focus on exemplary molecules which may be 

delivered via the adenoviral vectors of the invention. 

A. Ad-Mediated Delivery of Therapeutic Molecules 

In one embodiment, the above-described recombinant vectors are 
administered to humans according to published methods for gene therapy. A simian viral 

2 0 vector bearing the selected transgene may be administered to a patient, preferably 

suspended in a biologically compatible solution or pharmaceutically acceptable delivery 
vehicle. A suitable vehicle includes sterile saline. Other aqueous and non-aqueous 
isotonic sterile injection solutions and aqueous and non-aqueous sterile suspensions 
known to be pharmaceutically acceptable carriers and well known to those of skill in the 

25 art may be employed for this purpose. 

The simian adenoviral vectors are administered in sufficient amounts to 
transduce the target cells and to provide sufficient levels of gene transfer and expression 
to provide a therapeutic benefit without undue adverse or with medically acceptable 
physiological effects, which can be determined by those skilled in the medical arts. 

30 Conventional and pharmaceutically acceptable routes of administration include, but are 
not limited to, direct delivery to the retina and other intraocular delivery methods, direct 
delivery to the liver, inhalation, intranasal, intravenous, intramuscular, intratracheal, 
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subcutaneous, intradermal, rectal, oral and other parenteral routes of administration. 
Routes of administration may be combined, if desired, or adjusted depending upon the 
transgene or the condition. The route of administration primarily will depend on the 
nature of the condition being treated. 
5 Dosages of the viral vector will depend primarily on factors such as the 

condition being treated, the age, weight and health of the patient, and may thus vary 
among patients. For example, a therapeutically effective adult human or veterinary 
dosage of the viral vector is generally in the range of from about 100 \xL to about 100 mL 
of a carrier containing concentrations of from about 1 x 10 6 to about 1 x 10 15 particles, 

10 about 1 x 10 11 to 1 x 10 13 particles, or about 1 x 10 9 to lx 10 12 particles virus. Dosages 
will range depending upon the size of the animal and the route of administration. For 
example, a suitable human or veterinary dosage (for about an 80 kg animal) for 
intramuscular injection is in the range of about 1 x 10 9 to about 5 x 10 12 particles per mL, 
for a single site. Optionally, multiple sites of administration may be delivered. In another 

15 example, a suitable human or veterinary dosage may be in the range of about 1 x 10 u to 
about 1 x 10 15 particles for an oral formulation. One of skill in the art may adjust these 
doses, depending the route of administration, and the therapeutic or vaccinal application 
for which the recombinant vector is employed. The levels of expression of the transgene, 
or for an immunogen, the level of circulating antibody, can be monitored to determine the 

20 frequency of dosage administration. Yet other methods for determining the timing of 
frequency of administration will be readily apparent to one of skill in the art. 

An optional method step involves the co-administration to the patient, 
either concurrently with, or before or after administration of the viral vector, of a suitable 
amount of a short acting immune modulator. The selected immune modulator is defined 

25 herein as an agent capable of inhibiting the formation of neutralizing antibodies directed 
against the recombinant vector of this invention or capable of inhibiting cytolytic T 
lymphocyte (CTL) elimination of the vector. The immune modulator may interfere with 
the interactions between the T helper subsets (T H i or T H 2) and B cells to inhibit 
neutralizing antibody formation. Alternatively, the immune modulator may inhibit the 

30 interaction between T H i cells and CTLs to reduce the occurrence of CTL elimination of 
the vector. A variety of useful immune modulators and dosages for use of same are 
disclosed, for example, in Yang et al, J. Virol., 70(9) (Sept., 1996); International Patent 
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Application No. WO96/12406, published May 2, 1996; and International Patent 
Application No.PCT/US96/03035 5 all incorporated herein by reference. 
1 . Therapeutic Transgenes 

Useful therapeutic products encoded by the transgene include 
5 hormones and growth and differentiation factors including, without limitation, insulin, 
glucagon, growth hormone (GH), parathyroid hormone (PTH), growth hormone releasing 
factor (GRF), follicle stimulating hormone (FSH), luteinizing hormone (LH), human 
chorionic gonadotropin (hCG), vascular endothelial growth factor (VEGF), angiopoietins, 
angiostatin, granulocyte colony stimulating factor (GCSF), erythropoietin (EPO), 

10 connective tissue growth factor (CTGF), basic Fibroblast growth factor (bFGF), acidic 
fibroblast growth factor (aFGF), epidermal growth factor (EGF), transforming growth 
factor (TGF ), platelet-derived growth factor (PDGF), insulin growth factors I and II 
(IGF-I and IGF-II), any one of the transforming growth factor superfamily, including 
TGF, activins, inhibins, or any of the bone morphogenic proteins (BMP) BMPs 1-15, any 

15 one of the heregluin/neureguIin/ARIA/neu differentiation factor (NDF) family of growth 
factors, nerve growth factor (NGF), brain-derived neurotrophic factor (BDNF), 
neurotrophins NT-3 and NT-4/5, ciliary neurotrophic factor (CNTF), glial cell line 
derived neurotrophic factor (GDNF), neurturin, agrin, any one of the family of 
semaphorins/collapsins, netrin-1 and netrin-2, hepatocyte growth factor (HGF), ephrins, 

20 noggin, sonic hedgehog and tyrosine hydroxylase. 

Other useful transgene products include proteins that regulate the 
immune system including, without limitation, cytokines and lymphokines such as 
thrombopoietin (TPO), interleukins (IL) IL-1 through IL-25 (including, e.g., IL-2, IL-4, 
IL-12 and IL-1 8), monocyte chemoattractant protein, leukemia inhibitory factor, 

25 granulocyte-macrophage colony stimulating factor, Fas ligand, tumor necrosis factors 
and, interferons, and, stem cell factor, flk-2/flt3 ligand. Gene products produced by the 
immune system are also useful in the invention. These include, without limitation, 
immunoglobulins IgG, IgM, IgA, IgD and IgE, chimeric immunoglobulins, humanized 
antibodies, single chain antibodies, T cell receptors, chimeric T cell receptors, single 

30 chain T cell receptors, class I and class II MHC molecules, as well as engineered 

immunoglobulins and MHC molecules. Useful gene products also include complement 
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regulatory proteins such as complement regulatory proteins, membrane cofactor protein 
(MCP), decay accelerating factor (DAF), CR1, CF2 and CD59. 

Still other useful gene products include any one of the receptors 
for the hormones, growth factors, cytokines, lymphokines, regulatory proteins and 
5 immune system proteins. The invention encompasses receptors for cholesterol regulation, 
including the low density lipoprotein (LDL) receptor, high density lipoprotein (HDL) 
receptor, the very low density lipoprotein (VLDL) receptor, and the scavenger receptor. 
The invention also encompasses gene products such as members of the steroid hormone 
receptor superfamily including glucocorticoid receptors and estrogen receptors, Vitamin 

10 D receptors and other nuclear receptors. In addition, useful gene products include 

transcription factors such as jun,fos, max, mad, serum response factor (SRF), AP-1, AP2, 
myb, MyoD and myogenin, ETS-box containing proteins, TFE3, E2F, ATF1, ATF2, 
ATF3, ATF4, ZF5, NFAT, CREB, HNF-4, C/EBP, SP1, CCAAT-box binding proteins, 
interferon regulation factor (IRF-1), Wilms tumor protein, ETS-binding protein, STAT, 

15 GATA-box binding proteins, e.g., GATA-3, and the forkhead family of winged helix 
proteins. 

Other useful gene products include, carbamoyl synthetase I, 
ornithine transcarbamylase, arginosuccinate synthetase, arginosuccinate lyase, arginase, 
fumarylacetacetate hydrolase, phenylalanine hydroxylase, alpha- 1 antitrypsin, glucose-6- 

20 phosphatase, porphobilinogen deaminase, factor VIII, factor IX, cystathione beta- 
synthase, branched chain ketoacid decarboxylase, albumin, isovaleryl-coA 
dehydrogenase, propionyl CoA carboxylase, methyl malonyl CoA mutase, glutaryl CoA 
dehydrogenase, insulin, beta-glucosidase, pyruvate carboxylate, hepatic phosphorylase, 
phosphorylase kinase, glycine decarboxylase, H-protein, T-protein, a cystic fibrosis 

25 transmembrane regulator (CFTR) sequence, and a dystrophin cDNA sequence. 

Other useful gene products include non-naturally occurring 
polypeptides, such as chimeric or hybrid polypeptides having a non-naturally occurring 
amino acid sequence containing insertions, deletions or amino acid substitutions. For 
example, single-chain engineered immunoglobulins could be useful in certain 

30 immunocompromised patients. Other types of non-naturally occurring gene sequences 
include antisense molecules and catalytic nucleic acids, such as ribozymes, which could 
be used to reduce overexpression of a target. 
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Reduction and/or modulation of expression of a gene are 
particularly desirable for treatment of hyperproliferative conditions characterized by 
hyperproliferating cells, as are cancers and psoriasis. Target polypeptides include those 
polypeptides which are produced exclusively or at higher levels in hyperproliferative cells 
5 as compared to normal cells. Target antigens include polypeptides encoded by oncogenes 
such as myb, myc 5 fyn, and the translocation gene bcr/abl 5 ras, src, P53, neu, trie and 
EGRF. In addition to oncogene products as target antigens, target polypeptides for 
anti-cancer treatments and protective regimens include variable regions of antibodies 
made by B cell lymphomas and variable regions of T cell receptors of T cell lymphomas 

10 which, in some embodiments, are also used as target antigens for autoimmune disease. 
Other tumor-associated polypeptides can be used as target polypeptides such as 
polypeptides which are found at higher levels in tumor cells including the polypeptide 
recognized by monoclonal antibody 17-1 A and folate binding polypeptides. 

Other suitable therapeutic polypeptides and proteins include those 

15 which may be useful for treating individuals suffering from autoimmune diseases and 

disorders by conferring a broad based protective immune response against targets that are 
associated with autoimmunity including cell receptors and cells which produce self- 
directed antibodies. T cell mediated autoimmune diseases include Rheumatoid arthritis 
(RA), multiple sclerosis (MS), Sjogren's syndrome, sarcoidosis, insulin dependent 

2 0 diabetes mellitus (IDDM), autoimmune thyroiditis, reactive arthritis, ankylosing 

spondylitis, scleroderma, polymyositis, dermatomyositis, psoriasis, vasculitis, Wegener's 
granulomatosis, Crohn's disease and ulcerative colitis. Each of these diseases is 
characterized by T cell receptors (TCRs) that bind to endogenous antigens and initiate the 
inflammatory cascade associated with autoimmune diseases. 

2 5 The simian adenoviral vectors of the invention are particularly 

well suited for therapeutic regimens in which multiple adenoviral-mediated deliveries of 
transgenes is desired, e.g., in regimens involving redelivery of the same transgene or in 
combination regimens involving delivery of other transgenes. Such regimens may involve 
administration of a Pan5, Pan6, Pan7, SV1, SV25 or SV39 simian adenoviral vector, 

30 followed by re-administration with a vector from the same serotype adenovirus. 

Particularly desirable regimens involve administration of a Pan5, Pan6, Pan7, SV1, SV25 
or SV39 simian adenoviral vector of the invention, in which the serotype of the viral 
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vector delivered in the first administration differs from the serotype of the viral vector 
utilized in one or more of the subsequent administrations. For example, a therapeutic 
regimen involves administration of a Pan5, Pan6, Pan7, SV1, SV25 or SV39 vector and 
repeat administration with one or more adenoviral vectors of the same or different 
5 serotypes. In another example, a therapeutic regimen involves administration of an 

adenoviral vector followed by repeat administration with a Pan5, Pan6, Pan7, SV1, SV25 
or SV39 vector of the invention which differs from the serotype of the first delivered 
adenoviral vector, and optionally further administration with another vector which is the 
same or, preferably, differs from the serotype of the vector in the prior administration 

10 steps. These regimens are not limited to delivery of adenoviral vectors constructed using 
the Pan5, Pan6, Pan7, SV1, SV25 or SV39 simian serotypes of the invention. Rather, 
these regimens can readily utilize vectors other adenoviral serotypes, including, without 
limitation, other simian adenoviral serotypes (e.g., Pan9 or C68, CI, etc), other non- 
human primate adenoviral serotypes, or human adenoviral serotypes, in combination with 

15 one or more of the Pan5, Pan6, Pan7, SV1, SV25 or SV39 vectors of the invention. 

Examples of such simian, other non-human primate and human adenoviral serotypes are 
discussed elsewhere in this document. Further, these therapeutic regimens may involve 
either simultaneous or sequential delivery of Pan 5, Pan6, Pan7, SV1, SV25, and/or SV39 
adenoviral vectors of the invention in combination with non-adenoviral vectors, non-viral 

20 vectors, and/or a variety of other therapeutically useful compounds or molecules. The 
present invention is not limited to these therapeutic regimens, a variety of which will be 
readily apparent to one of skill in the art. 

B. Ad-Mediated Delivery of Immunogenic Transgenes 

The recombinant simian adenoviruses may also be employed as 

25 immunogenic compositions. As used herein, an immunogenic composition is a 

composition to which a humoral (e.g., antibody) or cellular (e.g., a cytotoxic T cell) 
response is mounted to a transgene product delivered by the immunogenic composition 
following delivery to a mammal, and preferably a primate. The present invention 
provides a recombinant simian Ad that can contain in any of its adenovirus sequence 

30 deletions a gene encoding a desired immunogen. The simian adenovirus is likely to be 
better suited for use as a live recombinant virus vaccine in different animal species 
compared to an adenovirus of human origin, but is not limited to such a use. The 
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recombinant adenoviruses can be used as prophylactic or therapeutic vaccines against any 
pathogen for which the antigen(s) crucial for induction of an immune response and able to 
limit the spread of the pathogen has been identified and for which the cDNA is available. 

Such vaccinal (or other immunogenic) compositions are formulated in a 
5 suitable delivery vehicle, as described above. Generally, doses for the immunogenic 

compositions are in the range defined above for therapeutic compositions. The levels of 
immunity of the selected gene can be monitored to determine the need, if any, for 
boosters. Following an assessment of antibody titers in the serum, optional booster 
immunizations may be desired. 

10 Optionally, a vaccinal composition of the invention may be formulated to 

contain other components, including, e.g. adjuvants, stabilizers, pH adjusters, 
preservatives and the like. Such components are well known to those of skill in the 
vaccine art. Examples of suitable adjuvants include, without limitation, liposomes, alum, 
monophosphoryl lipid A, and any biologically active factor, such as cytokine, an 

15 interleukin, a chemokine, a ligands, and optimally combinations thereof. Certain of these 
biologically active factors can be expressed in vivo, e.g., via a plasmid or viral vector. 
For example, such an adjuvant can be administered with a priming DNA vaccine 
encoding an antigen to enhance the antigen-specific immune response compared with the 
immune response generated upon priming with a DNA vaccine encoding the antigen 

2 0 only. 

The recombinant adenoviruses are administered in a "an immunogenic 
amount", that is, an amount of recombinant adenovirus that is effective in a route of 
administration to transfect the desired cells and provide sufficient levels of expression of 
the selected gene to induce an immune response. Where protective immunity is provided, 

2 5 the recombinant adenoviruses are considered to be vaccine compositions useful in 
preventing infection and/or recurrent disease. 

Alternatively, or in addition, the vectors of the invention may contain a 
transgene encoding a peptide, polypeptide or protein which induces an immune response 
to a selected immunogen. The recombinant adenoviruses of this invention are expected to 

30 be highly efficacious at inducing cytolytic T cells and antibodies to the inserted 
heterologous antigenic protein expressed by the vector. 
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For example, immunogens may be selected from a variety of viral 
families. Example of desirable viral families against which an immune response would 
be desirable include, the picornavirus family, which includes the genera rhinoviruses, 
which are responsible for about 50% of cases of the common cold; the genera 
5 enteroviruses, which include polioviruses, coxsackieviruses, echoviruses, and human 
enteroviruses such as hepatitis A virus; and the genera apthoviruses, which are 
responsible for foot and mouth diseases, primarily in non-human animals. Within the 
picornavirus family of viruses, target antigens include the VP1, VP2, VP3, VP4, and 
VPG. Another viral family includes the calcivirus family, which encompasses the 

10 Norwalk group of viruses, which are an important causative agent of epidemic 

gastroenteritis. Still another viral family desirable for use in targeting antigens for 
inducing immune responses in humans and non-human animals is the togavirus family, 
which includes the genera alphavirus, which include Sindbis viruses, RossRiver virus, 
and Venezuelan, Eastern & Western Equine encephalitis, and rubivirus, including Rubella 

15 virus. The flaviviridae family includes dengue, yellow fever, Japanese encephalitis, St. 
Louis encephalitis and tick borne encephalitis viruses. Other target antigens may be 
generated from the Hepatitis C or the coronavirus family, which includes a number of 
non-human viruses such as infectious bronchitis virus (poultry), porcine transmissible 
gastroenteric virus (pig), porcine hemagglutinating encephalomyelitis virus (pig), feline 

2 0 infectious peritonitis virus (cats), feline enteric coronavirus (cat), canine coronavirus 

(dog), and human respiratory coronaviruses, which may cause the common cold and/or 
non-A, B or C hepatitis. Within the coronavirus family, target antigens include the El 
(also called M or matrix protein), E2 (also called S or Spike protein), E3 (also called HE 
or hemagglutin-elterose) glycoprotein (not present in all coronaviruses), or N 

25 (nucleocapsid). Still other antigens may be targeted against the rhabdovirus family, 

which includes the genera vesiculovirus (e.g., Vesicular Stomatitis Virus), and the general 
lyssavirus (e.g., rabies). Within the rhabdovirus family, suitable antigens may be derived 
from the G protein or the N protein. The family filoviridae, which includes hemorrhagic 
fever viruses such as Marburg and Ebola virus, may be a suitable source of antigens. The 

30 paramyxovirus family includes parainfluenza Virus Type 1 , parainfluenza Virus Type 3, 
bovine parainfluenza Virus Type 3, rubulavirus (mumps virus), parainfluenza Virus Type 
2, parainfluenza virus Type 4, Newcastle disease virus (chickens), rinderpest, 
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morbillivirus, which includes measles and canine distemper, and pneumovirus, which 
includes respiratory syncytial virus. The influenza virus is classified within the family 
orthomyxovirus and is a suitable source of antigen (e.g., the HA protein, the Nl protein). 
The bunyavirus family includes the genera bunyavirus (California encephalitis, La 
5 Crosse), phlebovirus (Rift Valley Fever), hantavirus (puremala is a hemahagin fever 
virus), nairovirus (Nairobi sheep disease) and various unassigned bungaviruses. The 
arenavirus family provides a source of antigens against LCM and Lassa fever virus. The 
reovirus family includes the genera reovirus, rotavirus (which causes acute gastroenteritis 
in children), orbiviruses, and cultivirus (Colorado Tick fever, Lebombo (humans), equine 

10 encephalosis, blue tongue). 

The retrovirus family includes the sub-family oncorivirinal which 
encompasses such human and veterinary diseases as feline leukemia virus, HTLVI and 
HTLVII, lentivirinal (which includes human immunodeficiency virus (HIV), simian 
immunodeficiency virus (SIV), feline immunodeficiency virus (FIV), equine infectious 

15 anemia virus, and spumavirinal). Among the lentiviruses, many suitable antigens have 
been described and can readily be selected. Examples of suitable HIV and SIV antigens 
include, without limitation the gag, pol, Vif, Vpx, VPR, Env, Tat, Nef, and Rev proteins, 
as well as various fragments thereof. For example, suitable fragments of the Env protein 
may include any of its subunits such as the gpl20, gpl60, gp41, or smaller fragments 

2 0 thereof, e.g., of at least about 8 amino acids in length. Similarly, fragments of the tat 

protein may be selected. [See, US Patent 5,891,994 and US Patent 6,193,981.] See, also, 
the HIV and SIV proteins described in D.R Barouch et al, J. Virol., 75(5):2462-2467 
(March 2001), and R.R. Amara, et al, Science, 292:69-74 (6 April 2001). In another 
example, the HIV and/or SIV immunogenic proteins or peptides may be used to form 

25 fusion proteins or other immunogenic molecules. See, e.g., the HIV-1 Tat and/or Nef 

fusion proteins and immunization regimens described in WO 01/54719, published August 
2, 2001, and WO 99/16884, published April 8, 1999. The invention is not limited to the 
HIV and/or SIV immunogenic proteins or peptides described herein. In addition, a 
variety of modifications to these proteins have been described or could readily be made 

30 by one of skill in the art. See, e.g., the modified gag protein that is described in US Patent 
5,972,596. Further, any desired HIV and/or SIV immunogens may be delivered alone or 
in combination. Such combinations may include expression from a single vector or from 
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multiple vectors. Optionally, another combination may involve delivery of one or more 
expressed immunogens with delivery of one or more of the immunogens in protein form. 
Such combinations are discussed in more detail below. 

The papovavirus family includes the sub-family polyomaviruses (BKU 
5 and JCU viruses) and the sub-family papillomavirus (associated with cancers or 

malignant progression of papilloma). The adenovirus family includes viruses (EX, AD7, 
ARD, O.B.) which cause respiratory disease and/or enteritis. The parvovirus family 
feline parvovirus (feline enteritis), feline panleucopeniavirus, canine parvovirus, and 
porcine parvovirus. The herpesvirus family includes the sub-family alphaherpesvirinae, 

10 which encompasses the genera simplexvirus (HSVI, HSVII), varicellovirus 

(pseudorabies, varicella zoster) and the sub-family betaherpesvirinae, which includes the 
genera cytomegalovirus (HCMV, muromegalovirus) and the sub-family 
gammaherpesvirinae, which includes the genera lymphocryptovirus, EBV (Burkitts 
lymphoma), infectious rhinotracheitis, Marek's disease virus, and rhadinovirus. The 

15 poxvirus family includes the sub-family chordopoxvirinae, which encompasses the genera 
orthopoxvirus (Variola (Smallpox) and Vaccinia (Cowpox)), parapoxvirus, avipoxvirus, 
capripoxvirus, leporipoxvirus, suipoxvirus, and the sub-family entomopoxvirinae. 
The hepadnavirus family includes the Hepatitis B virus. One unclassified virus which 
may be suitable source of antigens is the Hepatitis delta virus. Still other viral sources 

2 0 may include avian infectious bursal disease virus and porcine respiratory and 

reproductive syndrome virus. The alphavirus family includes equine arteritis virus and 
various Encephalitis viruses. 

The present invention may also encompass immunogens which 
are useful to immunize a human or non-human animal against other pathogens including 

25 bacteria, fungi, parasitic microorganisms or multicellular parasites which infect human 
and non-human vertebrates, or from a cancer cell or tumor cell. Examples of bacterial 
pathogens include pathogenic gram-positive cocci include pneumococci; staphylococci; 
and streptococci. Pathogenic gram-negative cocci include meningococcus; gonococcus. 
Pathogenic enteric gram-negative bacilli include enterobacteriaceae; pseudomonas, 

30 acinetobacteria and eikenella; melioidosis; salmonella; shigella; Haemophilus; moraxella; 
H. ducreyi (which causes chancroid); brucella; Franisella tiilarensis (which causes 
tularemia); yersinia (pasteurella); streptobacillus moniliformis and spirillum; Gram- 
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positive bacilli include listeria monocytogenes; erysipelothrix rhusiopathiae; 
Corynebacterium diphtheria (diphtheria); cholera; B. anthracis (anthrax); donovanosis 
(granuloma inguinale); and bartonellosis. Diseases caused by pathogenic anaerobic 
bacteria include tetanus; botulism; other Clostridia; tuberculosis; leprosy; and other 
5 mycobacteria. Pathogenic spirochetal diseases include syphilis; treponematoses: yaws, 
pinta and endemic syphilis; and leptospirosis. Other infections caused by higher 
pathogen bacteria and pathogenic fungi include actinomycosis; nocardiosis; 
cryptococcosis, blastomycosis, histoplasmosis and coccidioidomycosis; candidiasis, 
aspergillosis, and mucormycosis; sporotrichosis; paracoccidiodomycosis, petriellidiosis, 

10 torulopsosis, mycetoma and chromomycosis; and dermatophytosis. Rickettsial infections 
include Typhus fever, Rocky Mountain spotted fever, Q fever, and Rickettsialpox. 
Examples of mycoplasma and chlamydial infections include: mycoplasma pneumoniae; 
lymphogranuloma venereum; psittacosis; and perinatal chlamydial infections. 
Pathogenic eukaryotes encompass pathogenic protozoans and helminths and infections 

15 produced thereby include: amebiasis; malaria; leishmaniasis; trypanosomiasis; 
toxoplasmosis; Pneumocystis carina; Trichcms; Toxoplasma gondii; babesiosis; 
giardiasis; trichinosis; filariasis; schistosomiasis; nematodes; trematodes or flukes; and 
cestode (tapeworm) infections. 

Many of these organisms and/or toxins produced thereby have 

2 0 been identified by the Centers for Disease Control [(CDC), Department of Heath and 

Human Services, USA], as agents which have potential for use in biological attacks. For 
example, some of these biological agents, include, Bacillus anthracis (anthrax), 
Clostridium botulinum and its toxin (botulism), Yersinia pestis (plague), variola major 
(smallpox), Francisella tularensis (tularemia), and viral hemorrhagic fevers [filo viruses 

2 5 (e.g., Ebola, Marburg], and arenaviruses [e.g., Lassa, Machupo]), all of which are 

currently classified as Category A agents; Coxiella burnetii (Q fever); Brucella species 
(brucellosis), Burkholderia mallei (glanders), Burkholderia pseudomallei (meloidosis), 
Ricinus communis and its toxin (ricin toxin), Clostridium perfringens and its toxin 
(epsilon toxin), Staphylococcus species and their toxins (enterotoxin B), Chlamydia 

30 psittaci (psittacosis), water safety threats (e.g., Vibrio cholerae, Crytosporidium parvum), 
Typhus fever {Rickettsia powazekii), and viral encephalitis (alphaviruses, e.g., 
Venezuelan equine encephalitis; eastern equine encephalitis; western equine 
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encephalitis); all of which are currently classified as Category B agents; and Nipan virus 
and hantaviruses, which are currently classified as Category C agents. In addition, other 
organisms, which are so classified or differently classified, may be identified and/or used 
for such a purpose in the future. It will be readily understood that the viral vectors and 
5 other constructs described herein are useful to deliver antigens from these organisms, 
viruses, their toxins or other by-products, which will prevent and/or treat infection or 
other adverse reactions with these biological agents. 

Administration of the vectors of the invention to deliver 
immunogens against the variable region of the T cells elicit an immune response 

10 including CTLs to eliminate those T cells. In RA, several specific variable regions of 
TCRs which are involved in the disease have been characterized. These TCRs include 
V-3, V-14, V-17 and Va-17. Thus, delivery of a nucleic acid sequence that encodes at 
least one of these polypeptides will elicit an immune response that will target T cells 
involved in RA. In MS, several specific variable regions of TCRs which are involved in 

15 the disease have been characterized. These TCRs include V-7 and Va~10. Thus, delivery 
of a nucleic acid sequence that encodes at least one of these polypeptides will elicit an 
immune response that will target T cells involved in MS. In scleroderma, several 
specific variable regions of TCRs which are involved in the disease have been 
characterized. These TCRs include V-6, V-8, V-14 and Va-16, Va-3C, Va-7, Va-14, 

20 Va-15, Va-16, Va-28 and Va-12. Thus, delivery of a recombinant simian adenovirus that 
encodes at least one of these polypeptides will elicit an immune response that will target 
T cells involved in scleroderma. 

C . Ad-Mediated Delivery Methods 

The therapeutic levels, or levels of immunity, of the selected gene can be 

25 monitored to determine the need, if any, for boosters. Following an assessment of CD8+ 
T cell response, or optionally, antibody titers, in the serum, optional booster 
immunizations may be desired. Optionally, the recombinant simian adenoviral vectors of 
the invention may be delivered in a single administration or in various combination 
regimens, e.g., in combination with a regimen or course of treatment involving other 

30 active ingredients or in a prime-boost regimen. A variety of such regimens have been 
described in the art and may be readily selected. 
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For example, prime-boost regimens may involve the administration of a 
DNA (e.g., plasmid) based vector to prime the immune system to second, booster, 
administration with a traditional antigen, such as a protein or a recombinant virus carrying 
the sequences encoding such an antigen. See, e.g., WO 00/1 1 140, published March 2, 
5 2000, incorporated by reference. Alternatively, an immunization regimen may involve 
the administration of a recombinant simian adenoviral vector of the invention to boost the 
immune response to a vector (either viral or DNA-based) carrying an antigen, or a 
protein. In still another alternative, an immunization regimen involves administration of a 
protein followed by booster with a vector encoding the antigen. 
10 In one embodiment, the invention provides a method of priming and 

boosting an immune response to a selected antigen by delivering a plasmid DNA vector 
carrying said antigen, followed by boosting with a recombinant simian adenoviral vector 
of the invention. In one embodiment, the prime-boost regimen involves the expression 
of multiproteins from the prime and/or the boost vehicle. See, e.g., R.R. Amara, Science, 
15 292:69-74 (6 April 2001) which describes a multiprotein regimen for expression of 
protein subunits useful for generating an immune response against HIV and SIV. For 
example, a DNA prime may deliver the Gag, Pol, Vif, VPX and Vpr and Env, Tat, and 
Rev from a single transcript. Alternatively, the SIV Gag, Pol and HIV-1 Env is delivered 
in a recombinant adenovirus construct of the invention. Still other regimens are described 
20 in WO 99/16884 and WO 01/54719. 

However, the prime-boost regimens are not limited to immunization for 
HIV or to delivery of these antigens. For example, priming may involve delivering with a 
first chimp vector of the invention followed by boosting with a second chimp vector, or 
with a composition containing the antigen itself in protein form. In one example, the 
25 prime-boost regimen can provide a protective immune response to the virus, bacteria or 
other organism from which the antigen is derived. In another desired embodiment, the 
prime-boost regimen provides a therapeutic effect that can be measured using convention 
assays for detection of the presence of the condition for which therapy is being 
administered. 

30 The priming composition may be administered at various sites in the body 

in a dose dependent manner, which depends on the antigen to which the desired immune 
response is being targeted. The invention is not limited to the amount or situs of 
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injection(s) or to the pharmaceutical carrier. Rather, the regimen may involve a priming 
and/or boosting step, each of which may include a single dose or dosage that is 
administered hourly, daily, weekly or monthly, or yearly. As an example, the mammals 
may receive one or two doses containing between about 10 jug to about 50 |ig of plasmid 
5 in carrier. A desirable amount of a DNA composition ranges between about 1 \xg to about 
10,000 |ng of the DNA vector. Dosages may vary from about 1 jig to 1000 |ig DNA per 
kg of subject body weight. The amount or site of delivery is desirably selected based 
upon the identity and condition of the mammal. 

The dosage unit of the vector suitable for delivery of the antigen to the 

10 mammal is described herein. The vector is prepared for administration by being 

suspended or dissolved in a pharmaceutically or physiologically acceptable carrier such as 
isotonic saline; isotonic salts solution or other formulations that will be apparent to those 
skilled in such administration. The appropriate carrier will be evident to those skilled in 
the art and will depend in large part upon the route of administration. The compositions of 

15 the invention may be administered to a mammal according to the routes described above, 
in a sustained release formulation using a biodegradable biocompatible polymer, or by 
on-site delivery using micelles, gels and liposomes. Optionally, the priming step of this 
invention also includes administering with the priming composition, a suitable amount of 
an adjuvant, such as are defined herein. 

2 0 Preferably, a boosting composition is administered about 2 to about 27 

weeks after administering the priming composition to the mammalian subject. The 
administration of the boosting composition is accomplished using an effective amount of 
a boosting composition containing or capable of delivering the same antigen as 
administered by the priming DNA vaccine. The boosting composition may be composed 

25 of a recombinant viral vector derived from the same viral source (e.g., adenoviral 
sequences of the invention) or from another source. Alternatively, the "boosting 
composition" can be a composition containing the same antigen as encoded in the 
priming DNA vaccine, but in the form of a protein or peptide, which composition induces 
an immune response in the host. In another embodiment, the boosting composition 

30 contains a DNA sequence encoding the antigen under the control of a regulatory sequence 
directing its expression in a mammalian cell, e.g., vectors such as well-known bacterial or 
viral vectors. The primary requirements of the boosting composition are that the antigen 
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of the composition is the same antigen, or a cross-reactive antigen, as that encoded by the 
priming composition. 

In another embodiment, the simian adenoviral vectors of the invention are 
also well suited for use in a variety of other immunization and therapeutic regimens. 
5 Such regimens may involve delivery of simian adenoviral vectors of the invention 

simultaneously or sequentially with Ad vectors of different serotype capsids, regimens in 
which adenoviral vectors of the invention are delivered simultaneously or sequentially 
with non-Ad vectors, regimens in which the adenoviral vectors of the invention are 
delivered simultaneously or sequentially with proteins, peptides, and/or other biologically 
10 useful therapeutic or immunogenic compounds. Such uses will be readily apparent to one 
of skill in the art. 

The following examples illustrate the cloning of the simian adenoviruses 
and the construction of exemplary recombinant adenovirus vectors of the present 
invention. These examples are illustrative only, and do not limit the scope of the present 
15 invention. 

Example 1 - Viral Propagation 

The Pan5 [ATCC Accession No. VR-591], Pan6 [ATCC Accession No. 
VR-592], and Pan7 [ATCC Accession No. VR-593] viruses, originally isolated from 

2 0 lymph nodes from chimpanzees, were propagated in 293 cells [ATCC CRL1573]. 
Typically, these cells are cultured in Duibecco's Modified Eagles Medium (DMEM; 
Sigma, St. Louis, MO.) supplemented with 10% fetal calf serum (FCS) [Sigma or 
Hyclone, Logan, UT] and 1 % Penicillin-Streptomycin (Sigma). Infection of 293 cells is 
carried out in DMEM supplemented with 2% FCS for the first 24 hours, after which FCS 

25 is added to bring the final concentration to 1 0%. Infected cells are harvested when 100% 
of the cells exhibit virus-induced cytopathic effect (CPE), and are then collected, and 
concentrated by centrifugation. Cell pellets are resuspended in 10 mM Tris (pH 8.0), and 
lysed by 3 cycles of freezing and thawing. Virus preparations are obtained following two 
ultra centrifugation steps on cesium chloride density gradients and stocks of virus are 

30 diluted to 1 to 5 x 10 12 particles/ml in 1 0 mM Tris/100 mM NaCl/50% glycerol and stored 
at -70°C. 
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The ability of 293 cells to propagate these adenoviruses exceeded expectations 
which were based on knowledge of other non-human adenovirus serotypes. 



Virus Yield [virus particles produced in 8x1 0 s cells") 
Pan5 8,8 xlO 13 

5 Pan6 1.6 xlO 14 

Pan? 8.8 x 10 13 

Example 2 - Characterization of Viral Genomic DNA 
10 Genomic DNA was isolated from the purified virus preparations of 



Example 1 and digested with Hindlll or BamHI restriction enzymes following the 
manufacturers' recommendations. The results (not shown) revealed that that the Pan5, 
Pan6, Pan7 genomes of the invention and the published Pan 9 (C68) genome show 
different restriction patterns, and thus, are distinct from each other. 

15 The nucleotide sequences of Pan5, Pan6 and Pan7 were determined. The 

nucleotide sequence of the top strand of Pan5 DNA is reported in SEQ ID NO: 1. The 
nucleotide sequence of the top strand of Pan6 DNA is reported in SEQ ID NO: 5. The 
nucleotide sequence of the top strand of Pan 7 DNA is reported in SEQ ID NO: 9. 

Regulatory and coding regions in the viral DNA sequences were 

20 identified by homology to known adenoviral sequences using the "Clustal W" program 
described above at conventional settings. See the tables above providing the adenoviral 
sequences. Open reading frames were translated and the predicted amino acid sequences 
examined for homology to previously described adenoviral protein sequences, Ad4, Ad5, 
Ad7, Adl2, and Ad40. 

25 Analysis of the sequence revealed a genome organization that is similar to 

that present in human adenoviruses, with the greatest similarity to human Ad4. However, 
substantial differences in the hexon hypervariable regions were noted between the 
chimpanzee adenoviruses and other known adenoviruses, including AdHu4. These 
differences fit well with the serological cross-reactivity data that has been obtained (see 

30 below). 

An alignment of a portion of the hexon sequences is shown in Fig. 1. The 
portion shown is from the region of the hexon that corresponds to the outwardly disposed 
extended loops DEI and FG1 where the most variability between serotypes is observed. 

54 



WO 03/046124 



PCT/US02/33645 



An intervening portion that contributes to the base of the hexon (corresponding to 
residues 308-368 of the published AdC68 sequence; US Patent 6,083,716), and is highly 
conserved between serotypes, is also present. The following table summarizes the pair- 
wise comparisons of the amino acids in the hexon proteins. 



Comparison 


Similarity (°/o) 


#1 


#2 




AdC5 


AdC7 


99.0 


AdC5 


AdC68 


98.3 


AdC5 


AdC6 


88.0 


AdC5 


AdCl 


84.9 


AdC6 


AdC7 


87.7 


AdC6 


AclC68 


87.3 


AdC6 


AdCl 


84.9 


AdC7 


AdC68 


97.5 


AdC7 


AdCl 


84.8 


AdC68 


AdCl 


84.9 



Analysis of the fiber knob domain (which is responsible for receptor 
binding) of the chimpanzee adenoviruses shows an overall similarity in structure (Fig. 2). 
10 The degree of sequence similarity between the El proteins of huAd5 and 

C68 (see Tables below) is similar to that between huAdS and Pan-5, Pan-6 5 and Pan-7. 
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Comparison 


Ela (13S) amino-acid 
identity (%) 


#1 


#2 




AdHu5 


AdC5 


36.6 


AdHu5 


AdC6 


28.5 


AdHu5 


AdC7 


34.9 


AdHu5 


AdC68 


35.6 


AdHu5 


AdCl 


35.6 


AdC5 


AdC6 


68.3 


AdC5 


AdC7 


96.9 


AdC5 


AdC68 


80.4 


AdC5 


AdCl 


51.3 


AdC6 


AdC7 


69.3 


AdC6 


AdC68 


59.4 


AdC6 


AdCl 


37.7 


AdC7 


AdC68 


81.5 


AdC7 


AdCl 


51.0 


AdC68 


AdCl 


54.9 





Sequence Identity with human 
Ad5 




Elb Small T 
Protein 


Elb Large T 
Protein 


C68 


47.3% 


55.8% 


Pan-5 


43.2% 


54.5% 


Pan-6 


45.3% 


54.5% 


Pan-7 


46.4% 


53.8% 
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Replication-defective versions of AdC5 ? AdC6 and AdC7 were created by 
molecular cloning methods described in the following examples in which minigene 
cassettes were inserted into the place of the Ela and Elb genes. The molecular clones of 
the recombinant viruses were rescued and grown up in 293 cells for large-scale 
5 purification using the published CsCl sedimentation method [K. Fisher et al 9 J. Virol., 
70:520 (1996)]. Vector yields were based on 50 plate (150 mm) preps in which 
approximately 1 x 10 9 293 cells were infected with the corresponding viruses. Yields 
were determined by measuring viral particle concentrations spectrophotometrically. After 
having constructed El -deleted vectors, it was determined that HEK 293 cells (which 

10 express human adenovirus serotype 5 El functions) trans-complement the El deletions of 
the novel viral vectors and allow for the production of high titer stocks. Examples of 
virus yields for a few of these recombinant viruses are shown in the table below. 

The transgenes for these vectors, P-galactosidase (LacZ), green 
fluorescent protein (GFP), alpha- 1-anti-trypsin (A1AT), ebola glycoprotein (ebo), a 

15 soluble ebola glycoprotein variant lacking the transmembrane and cytoplasmic domains 
(sEbo), and three deletion mutants of the ebola glycoprotein (EboA2, EboA3, and 
EboA4), were expressed by the cytomegalovirus promoter (CMV). In the following table, 
ND indicates that the study has not yet been done. 



Transgene 


Viral backbone/Vector yield (Viral particles x 10 13 ) 




AdHu5 




AdC7 




AdC68 




AdC6 


















CMVLacZ 


1.5 




1.4 




3.3 




6.1 


CMVGFP 


2.5 




3.6 




8 




10 


CMVA1AT 


3.7 




6 




10 




ND 


CMVEbo 


1.1 




4.3 




ND 




ND 


CMVsEbo 


4.9 




5.4 




ND 




ND 


CMVEboA2 


1 




9.3 




ND 




ND 


CMVEboA3 


0.8 




9.5 




ND 




ND 


CMVEboA4 


1.4 




6.2 




ND 




ND 



57 



WO 03/046124 



PCT/US02/33645 



The ability of human adenovirus El to trans-complement the El -deleted 
chimpanzee viruses of the invention is highly advantageous, as it permits the production 
of El -deleted chimpanzee adenoviral vectors of the invention, while reducing or 
eliminating the risk of homologous recombination due to the differences in sequences 
5 between human Ad and the chimpanzee adenoviruses described herein. 

Example 3 - Serological Studies of Pan 5, 6, and 7 Viruses 

Because of the differences in the hexon hypervariable region, it was anticipated 
that the C5 5 C6, and C7 viruses would be serologically distinct from human adenoviruses, 
10 including AdHu4. 

1 . Cross-Reactivity of Wild-type Viruses 

For screening of wild-type viruses in order to make a determination of 
antibody cross-reactivity, the replication competent viruses were used and inhibition of 
cytopathic effects (CPE) was measured. Briefly, preparations of adenoviruses (Adhu5, 
15 Pan-5, Pan-6, Pan-7 and AdC68) stored at 5x1 0 12 particles/ml were diluted 1/600 for the 
assays. This concentration of virus was selected since it results in 100% CPE within 48 
hours in the absence of neutralization. Prior to adding the virus to 293 cells (4x1 0 4 
cells/well in a 96 well dish), 1:20 dilutions of sera were added. The assay is read as the 
presence or absence of CPE; full neutralization would read as no cytopathic effect. The 
20 results are summarized in the Table below. The fact that 9/36 human sera neutralized 

AdhuS induced CPE is consistent with previous estimates of neutralizing antibodies in the 
human population. The numbers indicate the total individuals who showed neutralization 
(numerator) versus the total number screened (denominator). ND = not determined. 

Neutralization by 1/20 diln of serum 





Human 


Rhesus 


Chimpanzee 




(N=36) 


(N=52) 


(N=20) 


Adhu5 


9/36 


ND 


ND 


AdC68 


1/36 


0/52 


12/20 


Pan 5 


0/3 6 1 


0/52 


10/20 


Pan 6 


0/36 


0/52 


9/20 


Pan 7 


0/36 


0/52 


12/20 
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Of all human sera screened, 35/36 were negative for neutralization to AdC68 
while 36/36 were negative for neutralization to Pan-5, Pan-6 and Pan-7. Of 52 rhesus 
monkeys screened, none showed neutralization to any chimpanzee adenovirus; rhesus 
monkey is the preferred pre-clinical model for evaluating HIV vaccines. Between 9 to 12 
5 out of 20 chimpanzees had substantial neutralization to one or another of the chimpanzee 
adenoviruses consistent with the fact these are indeed endemic chimpanzee-specific 
pathogens. Interestingly, there are chimpanzees with neutralizing antibodies only to Pan- 
5, Pan-6 or AdC68 supporting the hypothesis that several of these chimpanzee adenoviral 
vectors will not cross neutralize each other and are distinct serotypes. 

10 The same assay was carried out for 20 chimpanzee serum samples. Fifty percent 

(50%) of the samples reacted serologically, in different degrees to Pan5; 40% to Pan6; 
55% to Pan7 and 60% to C68. Among the positive serum samples, one of them had 
strong neutralizing activity to all four chimp viruses. 

2. Cross-neutralization with Recombinant Viruses 

15 High-titer polyclonal antibodies were obtained to each of the simian 

adenoviruses in order to more precisely gauge the degree of cross-neutralization among 
the different serotypes. This was done by intramuscular immunization of rabbits using a 
recombinant virus containing GFP based on previously the described C68 chimpanzee 
adenovirus as an adjuvant. The serum was then used to assay for neutralizing activity 

2 0 against each of the three chimpanzee adenoviruses of the invention, AdC5, AdC6 and 
AdC7. A rabbit was injected with 5x1 0 12 viral particle per kg of C68CMVGFP vector 
intramuscularly and boosted 5 weeks later using the same dose. A bleed collected at the 9 
week time point revealed extremely potent neutralizing activity against C68 as well as 
Pan-5 and Pan-7 but not against Pan-6 (see Table below), indicating that the 

25 administration of a C68 (or Pan-5 and Pan-7) based vaccine could be effectively followed 
by a boost using a vector based on Pan-6. However, it has been found that this level of 
inter-relatedness does not necessarily prevent with re-administration in a setting where 
antiviral antibody titers were not as high as was achieved in this rabbit. In the following 
table, + indicates 33% CPE; ++ indicates 66% CPE; +++ indicates 100% CPE. 
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Infection on 293 cells with virus: 




Pan5 


Pan6 


Pan7 


Pan9(C68) 


C68 
GFP 


Serum 
Dilution 




+++ 


- 




- 


1/20 


- 


+++ 


- 


- 


- 


1/40 


- 


+++ 


- 


- 


- 


1/80 


- 


+++ 


- 


- 




1/160 


- 


+++ 


- 


- 


- 


1/320 | 


- 


+++ 


- 


- 


- 


1/640 


- 


-H-+ 


- 


- 


- 


1/1,280 


- 


+++ 


- 


- 


- 


1/2,560 


- 


+++ 


- 


- 


- 


1/5,120 


+ 


+++ 


- 


- 


- 


1/10,240 


+ 


+++ 


+4- 


- 


- 


1/20,480 


++ 


+++ 


+++ 






1/40,960 


++ 


+++ 


+++ 


+ 


+ 


1/81,920 


+++ 


+++ 


+++ 




++ 1 


1/163,840 


+++ \ 


+++ 


+++ 


+++ 


+++ 


1/327,680 


-H-+ 


+++ 


+++ 


+++ 


+++ 


1/665,360 


+++ 


+++ 


+++ 


+++ 


+++ 


1/1,310 5 720 


+++ 


+++ 


+++ 


+++ 


+++ 


1/2,621,440 
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3 . Quantitative Assay for Detection of Neutralizing A ntibody 

The result was validated by a more quantitative-based assay for detecting 
neutralizing antibody, which is based on transduction of a GFP vector. Briefly, groups of 
C57BL/6 mice were immunized intramuscularly or intravenously with 5.0 x 10 10 
5 particles/ml Pan5 5 Pan6, Pan7 or C68. Sera from day 28 bleeds were tested for cross- 
neutralizing activity against C68CMVEGFP at dilutions of 1/20 and 1/80. In summary, 
when a pharmaceutical preparation of human immunoglobulin was tested for serological 
reactions to Pan 5, 6, and 7, and C68, some low levels of neutralizing activities against 
Pan 7 and C68 were detected. No neutralizing activity against Pan5 or Pan6 was 
10 detected. Serum samples from 36 human subjects were run for the same assay. Serum 
samples were tested at a 1/20 dilution. The results indicated that only one individual has 
clear neutralizing activity to C68. No neutralizing activity to Pan5, Pan6 or Pan7 was 
detected. 

4. In Vitro Cross-Neutralization 

15 Cross-neutralization of the simian adenoviruses by high-titer rabbit 

polyclonal antibodies raised against each of the adenoviruses Pan-5, Pan-6, Pan-7, and 
C68 was tested. 

Rabbits were immunized with intra-muscular injections of 10 13 particles 
of each of the chimpanzee adenoviruses and boosted 40 days later with the same dose 

20 with incomplete Freund's adjuvant. Sera were analyzed or the presence of neutralizing 
antibodies by incubating serial two-fold dilutions with 10 9 genome copies of each of the 
appropriate chimpanzee adenovirus vector expressing GFP and testing for the attenuation 
of GFP expression when applied to 293 cells. The serum dilution which produced a 50% 
reduction of GFP expression was scored as the neutralizing antibody titer against that 

25 particular virus. 

The results are shown in the Table. The data are consistent with the 
expectation from sequence analysis of the hexon amino-acid sequences, which indicated 
that Ad Pan-6 was likely to be the most serologically distinct compared to the other 
chimpanzee adenoviruses. 



61 



WO 03/046124 



PCT/US02/33645 





Infection of 293 cells with 10 9 


genome copies of 




Serum from 
rabbit 
immunized 
with: 


Ad Pan-5 


Ad Pan-6 


Ad Pan-7 


Ad C68 


Ad Pan-5 

ZTkVl 1 Oil J 


1/5120 


<l/20 


1/2560 


1/2560 


Ad Pan-6 


No 

neutralization 


1/20,480 


<l/20 


<l/20 


Ad Pan-7 


1/2560 


1/160 


1/163,840 


1/2560 


Ad C68 


No 

neutralization 


<l/20 


<l/20 


1/5120 



In order to determine whether antibodies cross-reacting with the simian 
adenoviruses were likely to be of low prevalence in humans, simian adenoviruses SV1, 
SV39, and SV25 were tested for their ability to withstand neutralization when incubated 
5 with commercially available pooled human immunoglobulins (Ig). The same assay was 
also performed with Adhu5 and the chimpanzee adenoviruses Pan-5, Pan-6, Pan-7, and 
C68. In a further study, sera from mice has been immunized with one of the chimpanzee 
adenoviruses C5, C6, C7, and C68 and their ability to cross-neutralize the simian 
adenoviruses SV-15, SV-23, SA-17, and Baboon Adenovirus has been tested. No cross- 
10 reactivity was observed in any case. 

Example 4 - Generation of Recombinant El -Deleted Pan5 Vector 

A modified pX plasmid was prepared by destroying the Fspl site in the bla 
gene region of pX (Clontech) by site-directed mutagenesis. The resulting modified 
15 plasmid, termed pX\ is a circular plasmid of 3000 bp which contains an fl ori and an 
ampicillin resistance gene (AmpR-cds), 

A. Production of Pan-5 Adenovirus Plasmid 

A poly linker for sequential cloning of the Pan5 DNA fragments 
into pX' is created. The polylinker is substituted for the existing pX ? polylinker 
2 0 following digestion with MIuI and EcoRI. The bhmt-Fsel fragment of the Pan 5 is 

inserted into the Smal and Fsel sites of the polylinker. This fragment contains the 5' end 
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of the adenoviral genome (bp 1 to 3606, SEQ ID NO:l). The SnaBI-FspI fragment of 
Pan 5 (bp 455 to 3484, SEQ ID NO:l) is replaced with a short sequence flanked by I-Ceu 
and Pl-Sce sites from pShuttle (Clontech), to eliminate the El region of the adenoviral 
genome. The EcoRI-bhmt fragment of Pan5 (bp 28658 to 36462, SEQ ID NO:l) is 
5 inserted into the EcoRI and EcoR V sites of the polylinker (to provide the 3 5 end of the 
adenoviral genome); the Fsel-Mlul fragment (bp 3606 to 15135, SEQ ID NO:l) is 
inserted into the polylinker; and the MuI-EcoRI fragment is inserted into the polylinker 
(bp 15135 to 28658, SEQ ID NO:l). Optionally, a desired transgene is inserted into I- 
Ceul and Pl-Scel sites of the newly created pX'PanSAEl vector. 
0 B. Alternative Method of Generating pX'PanSAEl . 

The initial plasmid pX is derived from pAdX adenovirus 
plasmid available from Clontech, as described above. Thereafter, a Pacl-Xhol region 
of pX' was deleted and the blunt-ended Pan5 polylinker was inserted into the Fspl site 
to generate pX'PLNK (2994 bp). The 5 'end-i^e/ region of Pan 5 (bp 1-3607, SEQ 
ID NO:l) was inserted into Smaland Fsel sites of pX'LNK to generate the pX'PanS- 
5 5 plasmid (6591 bp). The SnaBi-Ndef region of pX 5 Pan5-5' was excised and 
replaced with the Ceu/Sce cassette, which had been PCR amplified from pRCS to 
create pX'Pan5-5'AEl (4374 bp). Briefly, a sequence containing I-Ceul and PI~SceI 
rare cutter sites was PCR amplified from pRCS (31 13bp). The 3' PCR primer was 
introduced an Ndel site into the PCR product. 

To extend the Pan5 DNA in pX'Pan5-5' AE1 (4374 bp), the 
Fsel-Mlul region of Pan 5 (bp 3607-15135, SEQ ID NO:l) is added, to create 
pXTan5-5'Mlu (15900 bp). The remaining MluI-3' end of the Pan5 sequence (bp 
15135-36462, SEQ ID NO:l) is added to the vector between the MM and EcoRV 
sites of the vector polylinker to form pX'Pan5AEl which contains the foil-length 
Pan5 sequence containing a deletion in the El region. 

C . Generation of Recombinant Viruses 

To generate the recombinant adenoviruses from pX'PanSAEl, 
the plasmid is co-transfected with a helper expressing El, or from an El -expressing 
packaging cell line, such as 293 cell line or a cell line prepared as described herein. 
The expression of El in the packaging cell permits the replication and packaging of 
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the Pan5 AE1 into a virion capsid. In another embodiment, the packaging cell 
transfected with pXTan5AEl is transfected with an adenovirus vector as described 
above bearing the transgene of interest. Homologous recombination occurs between 
the helper and the plasmid, which permits the adenovirus-transgene sequences in the 
5 vector to be replicated and packaged into virion capsids, resulting in the recombinant 
adenovirus. 

Transfection is followed by an agar overlay for 2 weeks, after 
which the viruses are plaqued, expanded and screened for expression of the transgene. 
Several additional rounds of plaque purification are followed by another expansion of 
10 the cultures. Finally the cells are harvested, a virus extract prepared and the 

recombinant chimpanzee adenovirus containing the desired transgene is purified by 
buoyant density ultracentrifugation in a CsCl gradient or by alternative means known 
to those of skill in the art. 



15 Example 5 - Generation of Recombinant El -Deleted Pan6 Vector 

A. Strategy for Construction of Pan-6 Adenoviral Plasmid 

1. Cloning of terminal fragments 

Pan 6 virus is deproteinated by pronase and proteanase K 
treatment and phenol extraction. Synthetic 12 bp Pme I linkers are ligated onto the viral 

2 0 DNA as described by Berkner and Sharp, Nucleic Acids Research, 11: 6003 (1983). The 
viral DNA is then digested with Xba I to isolate a 5' terminal fragment (6043 bp). The 
Ad6 Xbal 5' fragment is then ligated into pX link at Sma I and Xba I sites to form pX- 
AdPan6-0-16.5. The viral DNA with Pme I linkers is also digested with Pac I to isolate 
the 6475 bp 3' terminal fragment and cloned into pX link at Pac I and Sma I sites, 

2 5 resulting in pXAdPan6-82- 100. 

2. Deletion of El from the 5 ' clone 

To delete El (m.u. 1.2-9), the BsiWi-Xba I fragment in pX- 
AdPan6~0-l 6.5 is replaced with a PCR fragment spanning m.u.9-16.7 fragment treated 
with BsiWi and Xba I, leading to pX-Ad~Pan6 m.u.0-1, 9-16.5 . 
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3. Fusion of 5 ' and 3 ' clones and to create an anchor site to 
accept the middle Hind III fragment 

First, the 5' clone, pX-Ad-Pan6 m.u.0-1, 9-16.5, is further 
5 expanded by inserting the 2 nd Xba I fragment (4350 bp, m.u.16.5 - 28) from Pan 6 

genome into the Xba I site in the pX-Ad~Pan6 m.u.0-1, 9-16.5. This construct is named 
pXAd-Pan6-mu 0-1, 9-28. 

Second, the 3' clone is also expanded by inserting the 15026 bp 
Mlu I/Pac I fragment covering m.u.41-82 from Pan 6 genome into the Mlu I/Pac I sites of 
10 pXAdPan6-82-100, generating pXAdPan6-m.u.41-100. 

Then, a 8167 bp Hind III/Eco 47III Pan 6 fragment is isolated 
from pXAd-Pan6-mu 0-1, 9-28 and subcloned into pXAdPan6-m.u.41-100 at Hind III and 
Xba I blunt sites. This 5 5 and 3' fusion clone is called pXAdPan6mu0-l, 9-19.5, 64-100. 

4. Drop of the middle fragment of the genome into the fusion 

15 clone 

A 16335 bp Hind III fragment (m.u.19.5 - 64) from Pan 6 is 
inserted into Hind III site of pXAdPan6muO-l, 9-19.5, 64-100 to form pXAdPan6-0-l, 9- 
100. 

5. Introduction of a PKGFP selective marker into the final 
20 construct for direct cloning the gene of interest and green/white selection of 

recombinant transformants. 

A minigene cassette that expresses GFP under a lac promoter and 
is flanked with recognition sites of rare intron encoding restriction enzymes, Pl-Sce I and 
I-Ceu I, was isolated from pShuttle-pkGFP (bare) by Sap I and Dra III digestions 

25 followed by filling-in reaction. The pShuttle-pkGFP (bare) plasmid is 4126 bp in length, 
and contains a ColEl-Ori, a kanamycin resistance gene, plac, a LacZ promoter-GFPmut3- 
1 cds (Clontech), and a GFPmut3-l cds (Ciontech). This cassette is subcloned into Srf I 
cut and blunted pXAdPan6-0-l, 9-100. This final construct is called pX-Pan6-pkGFP 
mu.0-1, 9-100, which is useful for generating recombinant El-deleted Pan 6 molecular 

30 clones carrying genes of interest by direct ligation and green/white selection in 
combination with the generic pShuttlepkGFP vectors. 
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B. Alternative Strategy for Generation of Pan-6 Plasmid 
L Cloning of 5 J terminal fragment 

The Pan 6 virus is deproteinated by pronase and 
proteanase K treatment and phenol extraction as described above and synthetic 12 bp 
5 Pme I linkers are ligated onto the viral DNA as described. The AdPan6 5 ? Xbal fragment 
is isolated and ligated into pX to form pX-AdPan6-0-16.5 (9022 bp) as described in Part 
A above. 

2. Deletion of El from the 5 ' clone 

To delete El (m.u. 1.2-9), pX-AdPan6-0-16.5 is digested 
10 with SnaBI and Ndel to remove the regions encoding the El a and Elb proteins (3442- 
6310 bp). This vector is subsequently digested with BsiWI in preparation for blunting 
with the minigene cassette carrying a selective marker. 

3. Introduction of a selective marker 

A minigene cassette that expressed GFP under a lac 
15 promoter and which is flanked with recognition sites of rare intron encoding restriction 
enzymes, Pl-Xcel and I-Ceul, was isolated from pShuttle-pkGFP as described above. 
The Dralll-Sapl fragment is then ligated with the digested pX-AdPan6-0-16.5 to form 
pX-AdPan6 MU 0-16.5AE1 (7749 bp). 

4. Extension of Pan-6 Adenoviral Sequences 

20 pX-AdPan6 MU 0-16. 5AE1 was subjected to Xbal 

digestion to permit insertion of an Xbal-Rsrll linker. An Xbal/RsrII digestion fragment 
from the AdPan6 genome was isolated (mu 28-100, 26240 bp) and ligated into the 
Xba/RsrII-digested pX-AdPan6 MU 0-16.5AE1 to provide pX-AdPan6 MU 0-1, 9-16.5, 
28-100. A second Xbal fragment from the Pan6 genome (mu 16.5-28, 4350 bp) is then 

25 ligated into this plasmid to form pX-AdPan6 MU 0-1, 9-100 (3855 1 bp). 

C. Generation of Recombinant Adenoviruses 

To generate the recombinant adenoviruses from a El -deleted Pan6 
plasmid prepared as described in Parts A or b, the plasmid is co-transfected with a helper 
expressing El, or from an El -expressing packaging cell line, such as 293 cell line or a 
30 cell line prepared as described herein. The expression of El in the packaging cell permits 
the replication and packaging of the Pan6-pkGFP mu.0-1, 9-100 into a virion capsid. 
Alternatively, the packaging cell transfected with pX-Pan6-pkGFP mu.0-1, 9-100 is 
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transfected with an adenovirus vector as described above bearing another transgene of 
interest. 

Example 6 - Generation of Recombinant El -Deleted Pan7 Vector 
5 A. Generation of Pan7 Plasmids 

A synthetic linker containing the restriction sites Pacl-Smal-Fsel- 
MluI-EcoRV-PacI was cloned into pBR322 that was cut with EcoRI and Ndel. The left 
end (bpl to 3618) of Ad Pan7 was cloned into the linker between the Smal and Fsel sites. 
The adenovirus El was then excised from the cloned left end by cutting with SnaBI and 

10 Ndel and inserting an I-CeuI-GFP-PI-Scel cassette from pShuttle (Clontech) in its place. 
The resulting plasmid was cut with Fsel and Mlul and Ad Pan7 fragment Fsel (bp 3618) 
to Mlul (bp 155114 was inserted to extend the left end. The construct (pPan7pGFP) was 
completed by inserting the 21421 bp Ad Pan7 right end fragment from the Mlul site (bp 
15114) into the above plasmid between Mlul and EcoRV to generate a complete 

15 molecular clone of El deleted adenovirus Pan7 suitable for the generation of recombinant 
adenoviruses. Optionally, a desired transgene is inserted into the I-Ceul and Pl-Scel sites 
of the newly created pPan7 vector plasmid, 

B. Construction of El -Deleted Pan7 Viral Vectors 

To generate the recombinant adenoviruses from pPan7AEl, the 

20 plasmid is co-transfected with a helper expressing El, or from an El -expressing 

packaging cell line, such as 293 cell line or a cell line prepared as described herein. The 
expression of El in the packaging cell permits the replication and packaging of the 
Pan7AEl into a virion capsid. In another embodiment, the packaging cell transfected 
with pX'Pan7 AE1 is transfected with an adenovirus vector as described above bearing 

25 the transgene of interest. Homologous recombination occurs between the helper and the 
plasmid, which permits the adenovirus-transgene sequences in the vector to be replicated 
and packaged into virion capsids, resulting in the recombinant adenovirus. Transfection 
and purification is as described above. 

30 Example 7 - Generation of Plasmid Vectors Expressing the El Genes 

Plasmid vectors are constructed which encode the Pan5 El region gene, and these 
plasmids are used to generate stable cell lines expressing viral El proteins. 
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The El region of Pan5 is cloned into pX', essentially. as described in Example 4 
above, prior to replacement of this region with the fragment from pShuttle (Clontech). 
The expression plasmid contains the Pan5 adenoviral genome sequence spanning at least 
bp 1 to 3959 in the Pan5 genomic sequence. Thus, the expression plasmid contains the 
5 sequence encoding El a and Elb of chimpanzee Ad Pan5 under the control of a 

heterologous promoter. Similar expression plasmids can be generated using the Ad Pan6 
and AdPan 7 El regions, identified in the tables above. 

Example 8 - Generation of Cell Lines Expressing Chimpanzee Adenovirus El Proteins 

10 Cell lines expressing viral El proteins are generated by transfecting HeLa (ATCC 

Acc. No. CCL2) with the plasmid of Example 6. These cell lines are useful for the 
production of El -deleted recombinant chimpanzee adenoviruses by co-transfection of 
genomic viral DNA and the expression plasmids described above. Transfection of these 
cell lines, as well as isolation and purification of recombinant chimpanzee adenoviruses 

15 therefrom are performed by methods conventional for other adenoviruses, i.e., human 
adenoviruses [see, e.g., Horwitz, cited above and other standard texts]. 
A. Cell lines expressing Pan5 El proteins 

HeLa cells in 10cm dishes are transfected with 10 jug of pX-Pan51-El 
DNA using a Cellphect™ kit (Pharmacia, Uppsala, Sweden) and following the 

20 manufacturer's protocol. 22 hours post-transfection, the cells are subjected to a three 

minute glycerol shock (15% glycerol in Hepes Buffered Saline, pH 7.5) washed once in 
DMEM (HeLa) or F12K (A549; Life Technologies, Inc., Grand Island, NY) media 
supplemented with 10% FCS, 1% Pen-Strep, then incubated for six hours at 37°C in the 
above described media. The transfected cells are then split into duplicate 15cm plates at 

25 ratios of 1:20, 1:40, 1:80, 1:160, and 1:320. Following incubation at 37°C overnight, the 
media is supplemented with G418 (Life Technologies, Inc.) at a concentration of 1 |ig/ml. 
The media is replaced every 5 days and clones are isolated 20 days post-transfection. 

HeLa El cell clones are isolated and assayed for their ability to augment 
adeno-associated virus (AAV) infection and expression of recombinant LacZ protein as 

30 described below. 
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B. AAV Augmentation Assay for Screening El Expressing Cell 

Lines 

AAV requires adenovirus-encoded proteins in order to complete 
its life cycle. The adenoviral El proteins as well as the E4 region-encoded ORF6 protein 
5 are necessary for the augmentation of AAV infection. An assay for El expression based 
on AAV augmentation is used. Briefly, the method for identifying adenoviral El- 
expressing cells comprises the steps of infecting in separate cultures a putative adenovirus 
El -expressing cell and a cell containing no adenovirus sequence, with both an adeno- 
associated virus (AAV) expressing a marker gene and an AAV expressing the ORF6 of 
10 the E4 gene of human adenovirus, for a suitable time. The marker gene activity in the 
resulting cells is measured and those cells with significantly greater measurable marker 
activity than the control cells are selected as confirmed El -expressing cells. In the 
following experiment, the marker gene is a lacZ gene and the marker activity is the 
appearance of blue stain. 
15 For example, the cell lines described above, as well as 

untransfected control cells (HeLa) are infected with 100 genomes per cell of an AAV 
vector bearing a marker gene, e.g., AV.LacZ [K. Fisher et al 9 J. Virol., 70:520 (1996)] 
and an AAV vector expressing the ORF6 region of human 5 (AV.orfiS). The DNA 
sequence of the plasmid generates a novel recombinant adeno-associated virus (rAAV) 
20 containing the Z,acZ transgene and the Ad E4 ORF 6, which is an open reading frame 
whose expression product facilitates single-stranded (ss) to double-stranded (ds) 
conversion of rAAV genomic DNA. These vectors are incubated in medium containing 
2% FCS and 1% Pen-Strep at 37°C for 4 hours, at which point an equal volume of 
medium containing 10% FCS is added. It should be understood by one of skill in the art 
25 that any marker gene (or reporter gene) may be employed in the first AAV vector of this 
assay, e.g., alkaline phosphatase, luciferase, and others. An antibody-enzyme assay can 
also be used to quantitate levels of antigen, where the marker expresses an antigen. The 
assay is not limited by the identity of the marker gene. Twenty to twenty-four hours 
post-infection, the cells are stained for LacZ activity using standard methods. After 4 
30 hours the cells are observed microscopically and cell lines with significantly more blue 
cells than the A549 or HeLa cell controls are scored as positive. 
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Example 9 - Delivery of Transgene to Host Cell 

The resulting recombinant chimpanzee adenovirus described in Example 4, 5 or 6 
above is then employed to deliver the transgene to a mammalian, preferably human, cell. 
For example, following purification of the recombinant virus, human embryonic kidney 
5 293 cells are infected at an MOI of 50 particles per cell. GFP expression was 
documented 24 hours post-infection. 

A. Gene Transfer in Mouse Models via Pan-6, Pan-7, and Pan-9 vectors 
Gene transfer efficiencies and toxicological profile of recombinant 

chimpanzee adenoviruses were compared in mouse liver directed gene transfer, mouse 

10 lung directed gene transfer, and mouse muscle directed gene transfer. 

El -deleted adenoviral vectors containing LacZ under the control of the 
CMV promoter were constructed using the techniques herein for human Ad5, chimpanzee 
Pan 6, chimpanzee Pan 7 and chimpanzee Pan 9 (C68). The vectors were delivered to 
immune-deficient NCR nude mice (80 for each study) as follows. For the liver study, 100 

15 jlxI (1 xlO 11 particles) were injected into the tail vein. For the lung study, 50 jliI (5 xlO 10 
particles) were delivered intratracheally. For the muscle study, 25 |Lil (5 xlO I0 particles) 
were injected into tibialis anterior. The mice were sacrificed on days 3, 7, 14 and 28 post- 
vector injection (5 animals per group at each time point). At each necropsy, the 
liver/lung/muscle tissue was harvested and prepared for cryob locks and paraffin 

2 0 embedding. The cryoblocks were sectioned for X-gal staining and the paraffin sections 
are H&E stained for histopathic analysis. At each time point, terminal bleeding was 
performed. Serum samples were subjected to liver function tests. 

It was observed in this experiment the chimpanzee adenoviruses Pan-6, Pan-7, 
and Pan-9 were less efficient than huAd5 in gene transfer to the liver and to the lung. 

25 However, this may be desirable in certain circumstances, to reduce liver toxicity observed 
for huAd5. The gene transfer efficiency in muscle varied less between serotypes. 

B. Mouse study to feasibility of re-administration of adenovirus vectors by 
serotype switching between Adhu 5 > Pan-6, and Pan-9 vectors 

Mice were administered (C57/B16; 4/group) LacZ vectors based on 
30 huAdS, Pan-6, Pan-7, and Pan-9 (H5.040CMVLacZ, Pan6.000CMVLacZ, 

Pan7.000CMVLacZ, Pan9.000CMVLacZ; 10 1 particles/injection) by tail vein. Thirty 
days later the mice were re-administered adenovirus vectors expressing a 1 -antitrypsin 
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(H5.040CMVhAlAT, Pan6.000CMVhAlAT 5 lxlO 11 particles, Pan7.000CMVhAlAT, 
Pan9.000CMVhAlAT, 10 1 particles/injection). Successful transduction by there- 
administered vector is monitored by measuring serum a 1 -antitrypsin on days 3 and 7, 
following re-administration. 

The ability of adenovirus vectors based on huAd5, Pan-6, Pan-7, and Pan- 
9 respectively to transduce the livers of mice in the presence of neutralizing antibodies to 
the other serotypes was determined. The results are tabulated here. 



1 st injection 


2 nd injection 


Cross-neutralization 


Adhu5 


Adhu5 


Yes (+ve control) 




Pan-6 


No 




Pan-7 


No , 




Pan-9 (C68) 


No 


Pan-6 


Adhu5 


No 




Pan-6 


Yes (+ve control) 




Pan-7 


Yes 




Pan-9 (C68) 


No 


Pan-7 


Adhu5 


No 




Pan-6 


Yes 




Pan-7 


Yes (+ve control) 




Pan-9 (C68) 


Yes 
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1 st injection 


2 nd injection 


Cross-neutralization 


Pan-9 (C68) 


Adhu5 


No 




Pan-6 


No 




Pan-7 


Yes 




Pan-9 (C68) 


Yes (+ve control) 



Ability of vectors to transduce murine liver in the presence of neutralizing 
antibodies to other serotypes. 
5 Thus, immunization with huAd5 does not prevent re-administration with either of 

the chimpanzee adenovirus vectors Pan-6, Pan-7, or Pan-9 (C68). This experiment also 
appears to indicate that Pan-7 is between Pan-6 and Pan-9 in the spectrum of antigenic 
relatedness and cross-reacts with both; however Pan-6 and Pan-9 do not neutralize each 
other. This is a surprising result based on homology comparisons, which indicates that 
10 Pan-6 is quite distinct from Pan-7 and Pan-9. Evaluation of antisera generated against 

Pan-9 indicated no cross-neutralization against Pan-6 but some neutralization against Pan- 
7, arguing that Pan-6 is distinct from the others. 

15 Example 10 - Generation of Recombinant El -Deleted SV-25 Vector 

A plasmid was constructed containing the complete SV-25 genome except for an 
engineered El deletion. At the site of the El deletion recognition sites for the restriction 
enzymes I-Ceul and Pl-Scel which would allow insertion of transgene from a shuttle 
plasmid where the transgene expression cassette is flanked by these two enzyme 
20 recognition sites were inserted. 

A synthetic linker containing the restriction sites Swal-SnaBI-Spel-Aflll-EcoRV- 
Swal was cloned into pBR322 that was cut with EcoRI and NdeL This was done by 
annealing together two synthetic oligomers SV25T (5'-AAT TTA AAT ACG TAG CGC 
ACT AGT CGC GCT AAG CGC GGA TAT CAT TTA AA-3 \ SEQ ID NO: 38) and 
2 5 S V25B (5 >-TAT TTA AAT GAT ATC CGC GCT TAA GCG CGA CTA GTG CGC 
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TAC GTA TTT A-3\ SEQ ID NO:39) and inserting it into pBR322 digested with EcoRI 
and NdeL The left end (bpl to 1057, SEQ ID NO:29) of Ad SV25 was cloned into the 
above linker between the SnaBI and Spel sites. The right end (bp28059 to 31042, SEQ 
ID NO: 29) of Ad SV25 was cloned into the linker between the Aflll and EcoRV sites. 
5 The adenovirus El was then excised between the EcoRI site (bp 547) to Xhol (bp 2031) 
from the cloned left end as follows. A PGR generated I-CeuI-PI-Scel cassette from 
pShuttle (Clontech) was inserted between the EcoRI and Spel sites. The 10154 bp Xhol 
fragment of Ad SV-25 (bp2031 to 12185, SEQ ID NO:29) was then inserted into the Spel 
site. The resulting plasmid was digested with Hindlll and the construct (pSV25) was 

10 completed by inserting the 18344 bp Ad SV-25 Hindlll fragment (bpl 1984 to 30328, 
SEQ ID NO:29) to generate a complete molecular clone of El deleted adenovirus SV25 
suitable for the generation of recombinant adenoviruses. Optionally, a desired transgene 
is inserted into the I-Ceul and Pl-Scel sites of the newly created pSV25 vector plasmid. 
To generate an AdSV25 carrying a marker gene, a GFP (green fluorescent 

15 protein) expression cassette previously cloned in the plasmid pShuttle (Clontech) was 
excised with the restriction enzymes I-Ceul and Pl-Scel and ligated into pSV25 (or 
another of the Ad chimp plasmids described herein) digested with the same enzymes. 
The resulting plasmid (pSV25GFP) was digested with Swal to separate the bacterial 
plasmid backbone and transfected into the El complementing cell line HEK 293. About 

2 0 10 days later, a cytopathic effect was observed indicating the presence of implicative 

virus. The successful generation of an Ad SV25 based adenoviral vector expressing GFP 
was confirmed by applying the supernatant from the transfected culture on to fresh cell 
cultures. The presence of secondarily infected cells was determined by observation of 
green fluorescence in a population of the cells. 

25 

Example 1 1 - Construction of E3 deleted Pan-5, Pan-6 5 Pan- 7 and C68 vectors 

In order to enhance the cloning capacity of the adenoviral vectors, the E3 region 
can be deleted because this region encodes genes that are not required for the propagation 
of the virus in culture. Towards this end, E3~deleted versions of Pan-5, Pan-6, Pan-7, and 
30 C68 have been made (a 3.5 kb Nru-Avrll fragment containing E3 1-9 is deleted). 
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A. E3 deleted Pan5 based vector 

El-deleted pPan5-pkGFP plasmid was treated with Avr II endonuclease to 
isolate a 5.8 kb fragment containing the E3 region and re-circulate pPan5-pkGFP with 
Avr II deletion to form construct pPan5-pkGFP-E3-Avr IL Subsequently, the 5.8 kb Avr 
5 II fragment was subcloned into pSL-Pan5-E3-Avr II for a further deletion of E3 region by 
Nru I digestion. This led to a plasmid pSL-Pan5-E3-deletion. The final construct pPanS- 
E3-pkGFP was produced by removing a 4.3 kb Avr II/Spe I fragment from pSL-Pan5-E3~ 
deletion plasmid and inserting into pPan5-pkGFP-E3-Avr II at Avr II site. In this final 
construct, a 3.1 kb deletion in E3 region was accomplished. 
10 B. E3 deletion in Pan6 based vector 

El -deleted pPan6- pkGFP molecular clone was digested with Sbf I and 
Not I to isolate 19.3 kb fragment and ligated back at Sbf I site. The resulting construct 
pPan6-Sbf I-E3 was treated with Eco 47 III and Swa I, generating pPan6-E3. Finally, 21 
kb Sbf I fragment from Sbf I digestion of pPan6- pkGFP was subcloned into pPan6-E3 to 
15 create pPan6-E3 -pkGFP with a 4 kb deletion in E3. 

C . E3 deleted Pan 7 and Pan9 vectors 

The same strategy was used to achieve E3 deletions in both vectors. First, 
a 5.8 kb Avr II fragment spanning the E3 region was subcloned pSL-1 180, followed by 
deletion of E3 by Nru I digestion. The resulting plasmids were treated with Spe I and Avr 
2 0 II to obtain 4.4 kb fragments and clone into pPan7- pkGFP and pPan9-pkGFP at Avr II 
sites to replace the original E3 containing Avr II fragments, respectively. The final pPan7- 
E3- pkGFP and pPan9-E3- pkGFP constructs have 3.5 kb E3-deletions. 

Example 12 - Construction of E3- and E4-deleted Pan-7 vector 
2 5 Although the deletion of the El region of adenoviruses (first generation 

adenovirus vectors) renders them replication-incompetent, expression of the adenoviral 
vector backbone genes is not fully abolished. Deletion of the E4 region considerably 
attenuates this residual gene expression and may confer a safety advantage. An E4- 
deleted Pan-7 vector containing a 2.5 kb deletion (a PvuII-Agel fragment containing 
30 E40RF1-ORF7 is deleted) has been constructed. High titer stocks of this virus were 
generated using a HEK 293-based cell line, which in addition to El, expresses an 
essential E4 gene (orf 6). 
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1 . E4 deletion in the molecular clone of Pan! 

A 19 kb Xba I fragment was deleted from pPan7- pkGFP to create pPan7- 
Xba I from which a 2.5 kb E4 fragment was deleted by Age I and Pvu II partial digestion, 
resulting in pPan7-Xba I-E4. pPan7-E4~ pkGFP plasmid was generated from pPan7-Xba 
5 I-E4 in two sequential cloning steps, adding 19 kb Xba I and 15 kb I-Ceu I/Mlu 

I fragments, both of which came from pPan7- pkGFP construct. 

2. Introduction of E3 and E4 deletions in Pan9 vector 

A 1 1 kb plasmid, pPan9~EcoRI, containing E4 region was created by 
retrieving 1 1 kb EcoRI fragment from pPan9 pkGFP after EcoRI digestion and self- 
10 ligation. E4 region was deleted from this construct by Age I digestion/filled in and Pvu 

II partial digestion and slef-ligation to generate pPan9-EcoR I-E4. A 23 kb 
EcoR I fragment was isolated from pPan9-pkGFP and inserted into pPan9-EcoR 
I-E4 at EcoR I site, followed by adding 5.8 kb Avr II fragment from 
pPan9-pkGFP, to form the final product pPan9-E3-E4- pkGF. Compared to the 

15 genome size of wild type Pan9 ? this El-E3-E4-deleted vector could have a 
transgene capacity up to 8 kb. 

3. Introduction ofminigene cassettes with genes of interest including 
reporter genes, glyco- and nuclear proteins ofEbo into molecular clones of Pan 
vectors 

20 A highly efficient direct cloning and green/white selection procedure was 

employed for creating molecular clones of recombinant viruses. Briefly, genes of interest 
were cloned into pShuttlepkGFP by screening white colonies for recombinants. 
Subsequently, the minigene cassettes were transferred into chimpanzee 
adenovirus backbone plasmids, pPanX-pkGFP with various deletions, easily by 

25 swapping with pkGFP cassette at I-Ceu I and Pl-Sce I sites and screening a 
few white colonies for correct recombinants. 

4. Rescue of molecular clones of Pan vectors with multiple deletions in 
early regions and virus propagation 

For rescue of El-E3~deleted molecular clones of chimpanzee adenovirus 
30 vectors, the clones were linearized with appropriate restriction enzymes and transfected 
into regular 293 cells. Once a full cytopathic effect (CPE) observed in the transfected 
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cells, crude lysate was harvested and expanded in 293 cells to large-scale infections. The 
viruses were purified by CsCl sedimentation method. 

For E1-E4 and El-E3-E4-deleted Pan vectors, 10-3 cells, a 293 -based 
El-E4-complementing cell line, were used for rescue and propagation of vectors. E4 
5 ORF6 gene expression in 10-3 cells was induced by addition of 150 pM ZnS0 4 to the 
culture medium. 

Example 13 - Vaccination with adenovirus vectors expressing wild type and variant 
EboZ GP. 

10 AdHu5 or AdC7 vectors expressing Ebola envelope chimeras were 

produced for in vivo immunization experiments in C57BL/6 mice. Recombinant 
viruses with different viral backbones were created by molecular cloning method 
in which the minigene cassettes were inserted into the place of El-deletions. The 
molecular clones of all recombinant viruses were rescued and grown up in 293 

15 cells for large-scale purification using CsCl sedimentation method. 

Five EboZ variants encoded by AdHu5 or AdPan7 (C7) were selected and 
produced to evaluate their relative immunogenicity following an intramuscular 
Ad injection. The wt Ebo, a soluble Ebo variant, EboAl, EboA2, EboA3, EboA4, 
EboA5S, EboA6S, EboA7S and EboA8S were evaluated in the initial vaccine 

2 0 studies. For the data summarized in the following table, the number of viral 
particles (per ml or total) produced and amplified from infected 293 cells was 
established by spectrophotometry reading. 
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Table: Production of AdhuS or AdC7 Adenoviral vector encoding EboZ variant. 





HuAd5 




AdC7 




Gene 


Titer 
(VP x 
10 12 /ml) 


Total 
yield 
(VPx 
10 12 ) 


Titer 

(VPx 

10 12 /ml) 


Total 
yield 
(VPx 
10 12 ) 


Ebo wt 


2.6 


12 


4.3 


43 


EboS 


4.9 


49 


4.6 


55 


EboA2 


2.1 


9 


5.8 


93 


EboA3 


1.7 


8 


5.3 


95 


EboA4 


3 


12 


4.1 


62 



Vector was administered intramuscularly (10 n genome copies/cell) in C57BL/6 
5 mice and the presence of virus neutralizing antibody (VNAO was evaluated 28 days later 
as a first measure of an immune response generated against the Ebola envelope 
glycoprotein. VNA is defined here as serum antibody able to inhibit transduction of HeLa 
cells mediated by HIV-based vector pseudotyped with the wild-type Ebola envelope. 

VNA to the EboZ pseudotypes was detected with AdPan7 (C7) yielding higher 
10 titers than AdHu5. The EboZA3 elicited the highest VNA in terms of the transgene 
targets. For the data summarized in the following table, neutralizing antibody titers to 
HIV-EboZ-GFP pseudotypes (reciprocal dilution) are provided (N=5 animals/group). 

VNA Titers 

EboZ wildtype EboZs EboZA3 
AdHu5 12 16 12 

AdC7 44 12 140 
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Example 14 — Pan7-mediated Expression of Ebola Proteins 

Mouse studies to evaluate Pan-7 vectors expressing Ebola envelope proteins and 
the Ebola nuclear antigen have been initiated. These are directed towards evaluation of 
neutralizing antibodies in C57B1/6 mice injected intramuscularly (IM) with Adhu5 or 
5 Pan-7 expressing each of 4 Ebola env constructs. 

A. Evaluation of CTL from C57BI/6 mice injected IMwith Adhu5 or Pan- 
7 expressing the Ebola env constructs. 

1 . Challenge experiment in mice with Ebola virus. 

Neutralizing antibody (NAB) responses to the Ebola envelope 

10 were analyzed by looking at immunized mouse sera mediated neutralization of a lentiviral 
(HIV) vector pseudotyped with the several constructs (eEbo, NTD2, NTD3, NTD4) of the 
Ebola envelope glycoprotein. C57BL/6 or BALB/c mice received a single intramuscular 
injection of 5 x 10 10 particles per mouse of C7 (Ad Pan-7) encoding Ebola envelope 
variant. Neutralizing antibody was evaluated 30 days post-vaccination. Briefly, Ebola 

15 Zaire pseudotyped HIV vector encoding for B-galactosidase (EboZ-HIV-LacZ) was 
incubated for 2 hr at 37°C with different dilution of heat inactivated mouse serum.. 
Following the incubation with serum, EboZ-HIV-LacZ was then used to infect HeLa cells 
for 16 hr at 37°C. Infectivity was revealed by X-gal staining of transduced HeLa cells 
positive for B-galactosidase. Neutralizing titer represent the serum reciprocal dilution 

2 0 where a 50% decrease in the number of B-galactosidase positive blue cells was observed. 
Sera were collected 30 days post-immunization, which consisted in a single intramuscular 
(I.M.) administration of 5 x 10 10 particles/animal. Neutralizing antibody to Ebola 
pseudotyped HIV could be detected from all groups with antibody titers ranging from 20 
for Ad-EboZ (Adhu5 expressing EboZ), Ad-NTD3 (Adhu5 expressing NTD3) and C7- 

2 5 sEbo (Ad Pan-7 expressing soluble EboZ) to over 130 for C7-NTD3 (Ad Pan-7 

expressing soluble NTD3) and C7-NTD4 (Ad Pan-7 expressing soluble NTD3). The same 
immunization strategy in BALB/c mice resulted in lower neutralizing antibody titers for 
Ad- and C7-NTD2, and NTD4. 

B. Cellular Immune Response 

30 The cellular immune response to the Ebola envelope in C57BL/6 mice 

was evaluated 8 days after a single I.M. administration of 5 x 10 10 particles of C7-LacZ or 
C7-Ebola envelope variant per animal. Mice were vaccinated I.M. with 5 x 10 10 particles 
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of C7 encoding LacZ or Ebola envelope variant. Splenic lymphocytes from immunized 
mice were collected 8 days post vaccination and stimulated in vitro with feeder cells 
(splenic lymphocytes from untreated mice infected with human Adenovirus serotype 5 
encoding for the wild-type Ebola envelope and irradiated). Standard 5-hr CTL assays 
5 were performed using 51 Cr-labeled syngenic C57 cells transfected with an expressor of 
EboZ. 

A positive MHC -restricted cytotoxic T lymphocyte (CTL) response was observed 
from all AdPan-7 encoding for Ebola envelope variants with a higher response from 
NTD2, NTD3 or NTD4 immunized mice. Indeed, effector cells from C7 encoding Ebola 

10 envelope variant immunized mice recognized EboZ transfected target cells and gave 

recall CTL responses up to 30% specific lysis. Less than 5% lysis was seen with effector 
cells from naive or LacZ immunized control mice confirming that lysis was specific for 
Ebola envelope antigens. 

C. Protection Studies 

15 The most direct means of evaluating C7 (Ad Pan-7) encoding for the 

EboZ variants as a successful vaccine in mice was to assess protection against weight loss 
and death following lethal challenge with mouse adapted Ebola Zaire virus. BALB/c mice 
were immunized with a single dose of 5 x 10 10 particles per animal as performed 
previously and vaccinated animals were challenged with 200 LD 50 of mouse adapted 

20 Ebola Zaire 21 days later. All control mice (vehicle and C7~LacZ) died between day 5 to 
day 9 post-challenge. In contrast, all vaccinated mice but one, (from the C7-sEbo group), 
survived the challenge with Ebola Zaire. 

Weight loss was observed from mice vaccinated with C7-sEbo from day 4 to day 
7. Signs of illness such as pilo-erection and from light to severe lethargy were also noted 

25 from mice vaccinated with C7~sEbo, NTD2 and NTD3 between day 4 to day 7. Mice 
immunized with C7-EboZ and C7-NTD4 did not show sign of illness. Overall, a single 
dose of C7-EboZ and C7-NTD4 completely protected immunized mice from illness and 
death possibly due to a significant T cell mediated immunity. 



79 



WO 03/046124 



PCT/US02/33645 



All documents recited above are incorporated herein by reference. Numerous 
modifications and variations of the present invention are included in the scope of the 
above-identified specification and are expected to be obvious to one of skill in the art. 
5 Such modifications and alterations to the compositions and processes of the present 
invention, such as selections of different minigenes or selection or dosage of the 
vectors or immune modulators are believed to be within the scope of the claims 
appended hereto. 
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WHAT IS CLAIMED IS: 

1 . An isolated simian adenovirus nucleic acid sequence selected from the 
group consisting of: 

(a) Pan5 having the sequence of nucleic acids 1 to 36462 of SEQ ID 

NO:l; 

(b) Pan6 having the sequence of nucleic acids 1 to 36604 of SEQ ID 

NO:5; 

(c) Pan7 having the sequence of nucleic acids 1 to 36535 of SEQ ID 

NO:9; 

(d) SV1 having the sequence of nucleic acids 1 to 34264 of SEQ ID NO: 

24; 

(e) SV25 having the sequence of nucleic acids 1 to 31044 of SEQ ID NO: 

29; 

(f) SV39 having the sequence of to nucleic acids 1 to 341 1 5 of SEQ ID 

NO: 34, and 

(g) a nucleic acid sequence complementary to the sequence of any of (a) 

to (f). 

2. An isolated simian adenovirus serotype nucleic acid sequence selected 
from one or more of the group consisting of: 

(a) 5' inverted terminal repeat (ITR) sequences; 

(b) the adenovirus El a region, or a fragment thereof selected 
from among the 13S, 12S and 9S regions; 

(c) the adenovirus Elb region, or a fragment thereof selected 
from among the group consisting of the small T, large T, IX, and IVa2 regions; 

(d) the E2b region; 

(e) the LI region, or a fragment thereof selected from among 
the group consisting of the 28.1 kD protein, polymerase, agnoprotein, 52/55 kD 
protein, and Ilia protein; 
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(f) the L2 region, or a fragment thereof selected from the group 
consisting of the penton, VII, VI, and Mu proteins; 

(g) the L3 region, or a fragment thereof selected from the group 
consisting of the VI, hexon, or endoprotease; 

(h) the 2a protein; 

(i) the L4 region, or a fragment thereof selected from the group 
consisting of the 100 kD protein, the 33 kD homolog, and VIII; 

(j) the E3 region, or a fragment thereof selected from the group 
consisting of E3 ORF1, E3 ORF2, E3 ORF3, E3 ORF4, E3 ORF5, E3 ORF6, E3 
ORF7, E3 ORF8, and E3 ORF9; 

' (k) the L5 region, or a fragment thereof selected from a fiber 

protein; 

(1) the E4 region, or a fragment thereof selected from the group 
consisting of E4 ORF7, E4 ORF6, E4 ORF4, E4 ORF3, E4 ORF2, and E4 ORFljand 

(m) the 3' ITR, 

of any of Pan5 SEQ ID NO:l; Pan6 SEQ ID NO:5; Pan7 SEQ ID NO:9; 
SV1 SEQ ID NO: 24; SV25 SEQ ID NO: 29; and SV39 SEQ ID NO: 34, or sequence 
complementary to any of (a) to (m). 

3. A simian adenovirus protein encoded by the nucleic acid sequence 
according to claim 2. 

4. A nucleic acid molecule comprising a heterologous simian adenoviral 
sequence according to claim 2. 

5. The nucleic acid molecule according to claim 4, wherein said simian 
adenoviral sequence encodes an adenoviral gene product and is operatively linked to 
regulatory control sequences which direct expression of the adenoviral gene product in a 
host cells. 
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6. The nucleic acid molecule according to claim 4 or 5, wherein said simian 
adenoviral sequence comprises the El a region of Pan5 SEQ ID NO:l; Pan6 SEQ ID 
NO:5; Pan7 SEQ ID NO:9; SV1 SEQ ID NO: 24; SV25 SEQ ID NO: 29; and SV39 SEQ 
ID NO: 34. 

7. A pharmaceutical composition comprising the nucleic acid molecule 
according to claim 6 and a physiologically compatible carrier. 

8. An isolated simian adenoviral capsid protein selected from the group 
consisting of: 

(a) a hexon protein of Pan5 SEQ ID NO:3, Pan6 SEQ ID NO:7, 
Pan7 SEQ ID NO: 11, SV1 SEQ ID NO:26, SV25 SEQ ID NO:31or SV39 SEQ ID 
NO: 3 6, or fragment thereof; 

(b) a penton protein of Pan5 SEQ ID NO:2, Pan6 SEQ ID 
NO:6, Pan7 SEQ ID NO:10, SV1 SEQ ID NO:25, SV25 SEQ ID NO: 30 or SV39 
SEQ ID NO:35; 

(c) a fiber protein of Pan5 SEQ ID NO:4, Pan6 SEQ ID NO:8, 
Pan7 SEQ ID NO: 12, SV1 SEQ ID NO: 27 and SEQ ID NO:28, SV25 SEQ ID NO: 
32 and SEQ ID NO:33 or SV39 SEQ ID NO: 37, or a fragment thereof. 

9. An artificial adenovirus serotype comprising a capsid protein 
according to claim 8 or a fragment thereof. 

10. The artificial adenovirus serotype according to claim 9, wherein said 
capsid comprises a fragment of the hexon protein selected from the group consisting 
of Pan5 SEQ ID NO: 15, Pan6 SEQ ID NO:16 and Pan7 SEQ ID NO:17. 

1 1 . The artificial adenovirus serotype according to claim 9 or 10, wherein 
said capsid comprises a fragment of the fiber protein selected from the group 
consisting of Pan6 SEQ ID NO: 19, Pan7 SEQ ID NO:20 and Pan5 SEQ ID NO:21. 
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12. A nucleic acid molecule comprising a heterologous sequence encoding 
a protein according to claims 3 or 8 or an artificial adenovirus serotype according to 
any of claims 9 to 11. 

13. A recombinant vector comprising a simian adenovirus sequence 
according to claim 2 or a nucleic acid molecule according to claim 4 or 12 and a 
heterologous gene operatively linked to sequences which direct expression of said 
gene in a host cell. 

14. The recombinant vector according to claim 13, further comprising 5 5 
and 3 5 adenovirus cis-elements necessary for replication and encapsidation. 

15. The recombinant vector according to claim 13 or claim 14, wherein 
said vector is a virus. 

16. The recombinant vector according to any of claims 13 to 15 wherein 
said vector lacks all or a part of the El gene. 

17. A recombinant virus comprising a simian capsid protein according to 
claim 3 or an artificial adenovirus serotype according to any of claims 9 to 11. 

18. A host cell comprising a nucleic acid molecule according to any of 
claims 4 to 6 or 12 , a recombinant vector according to any of claims 13 to 16, or a 
recombinant virus according to claim 17. 

19. The host cell according to claim 1 8, wherein said host cell is stably 
transformed with the nucleic acid molecule or the recombinant vector. 
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20. The host cell according to claim 18 or claim 19, wherein said host cell 
expresses one or more adenoviral gene products from said nucleic acid molecule or 
recombinant vector, said adenoviral gene products selected from the group consisting 
of El a, Elb, E2a, and E4 ORF6. 

21 . The host cell according to any of claims 1 8 to 20, wherein said host 
cell is stably transformed with a nucleic acid molecule comprising the simian 
adenovirus inverted terminal repeats. 

22. A composition comprising a recombinant vector in a pharmaceutically 
acceptable earner, said vector comprising a simian adenovirus sequence according to 
any of claims 1 or 2 and a selected heterologous gene operatively linked to regulatory 
sequences which direct expression of said gene in a host cell. 

23. A method for delivering a heterologous gene to a mammalian cell 
comprising introducing into said cell an effective amount of the vector of any of 
claims 13 to 16 or a virus according to claim 17. 

24. A method for repeat administration of a heterologous gene to a 
mammal comprising the steps of: 

(a) introducing into said mammal a first vector which 
comprises the heterologous gene and 

(b) introducing into said mammal a second vector which 
comprises the heterologous gene; 

wherein at least the first virus or the second vector is a virus according 
to claim 17 and wherein the first and second recombinant vector are different. 

25 . A method for producing a selected gene product comprising infecting a 
mammalian cell with the vector of any of claims 13 to 16 or a virus according to 
claim 17, culturing said cell under suitable conditions and recovering from said cell 
culture the expressed gene product. 
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26. A method for eliciting an immune response in a mammalian host 
against an infective agent comprising administering to said host an effective amount 
of the recombinant adenovirus of claim 17, wherein said heterologous gene encodes 
an antigen of the infective agent. 

27. The method according to claim 26, comprising the step of priming the 
host with a DNA vaccine comprising the heterologous gene prior to administering the 
recombinant adenovirus . 

28. A composition comprising a simian adenovirus capsid protein 
according to claim 8 linked to a heterologous molecule for delivery to a selected host 
cell. 

29. A method for targeting a cell having an adenoviral receptor comprising 
delivering to a subject a composition according to claim 28. 
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SEQUENCE LISTING 

<110> The Trustees of the University of Pennsylvania 
Wilson, James M. 
Gao, Guangping 
Roy, Soumitra 

<12 0> Simian Adenovirus Nucleic Acid and Amino Acid Sequences, 
Vectors Containing Same, and Methods of Use 

<130> UPN-02 677PCT 

<150> US 60/331,951 
<151> 2001-11-21 

<150> US 60/366,798 
<151> 2002-03-22 

<160> 39 

<170> Patentln version 3.1 

<210> 1 

<211> 36462 

<212> DNA 

<213> chimpanzee adenovirus serotype Pan5 

<220> 

<221> CDS 

<222> (13898) . . (15490) 

<223> L2 Penton 

<220> 

<221> CDS 

<222> (18315) . . (21116) 

<223> L3 Hexon 

<220> 

<221> CDS 

<222> (32035) (33372) 

<223> L5 Fiber 

<400> 1 



catcatcaat 


aatatacctc 


aaacttttgg 


tgcgcgttaa 


tatgcaaatg 


aggtatttga 


60 


atttggggat 


gcggggcggt 


gattggctgc 


gggagcggcg 


accgttaggg 


gcggggcggg 


120 


tgacgttttg 


atgacgtggc 


cgtgaggcgg 


agccggtttg 


caagttctcg 


tgggaaaagt 


180 


gacgtcaaac 


gaggtgtggt 


ttgaacacgg 


aaatactcaa 


ttttcccgcg 


ctctctgaca 


240 


ggaaatgagg 


tgtttctggg 


cggatgcaag 


tgaaaacggg 


ccattttcgc 


gcgaaaactg 


300 


aatgaggaag 


tgaaaatctg 


agtaattccg 


cgtttatggc 


agggaggagt 


atttgccgag 


360 


ggccgagtag 


actttgaccg 


attacgtggg 


ggtttcgatt 


accgtatttt 


tcacctaaat 


420 
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ttccgcgtac ggtgtcaaag tccggtgttt ttacgtaggt gtcagctgat cgccagggta 480 

tttaaacctg cgctctctag tcaagaggcc actcttgagt gccagcgagt agagttttct 540 

cctccgcgcc gcgagtcaga tctacacttt gaaagatgag gcacctgaga gacctgcccg 600 

gtaatgtttt cctggctact gggaacgaga ttctggaact ggtggtggac gccatgatgg 660 

gtgacgaccc tccggagccc cctaccccat ttgaagcgcc ttcgctgtac gatttgtatg 720 

atctggaggt ggatgtgccc gagaacgacc ccaacgagga ggcggtgaat gatttgttta 780 

gcgatgccgc gctgctggct gccgagcagg ctaatacgga ctctggctca gacagcgatt 840 

cctctctcca taccccgaga cccggcagag gtgagaaaaa gatccccgag cttaaagggg 900 

aagagctcga cctgcgctgc tatgaggaat gcttgcctcc gagcgatgat gaggaggacg 960 

aggaggcgat tcgagctgca gcgaaccagg gagtgaaaac agcgagcgag ggctttagcc 1020 

tggactgtcc tactctgccc ggacacggct gtaagtcttg tgaatttcat cgcatgaata 1080 

ctggagataa gaatgtgatg tgtgccctgt gctatatgag agcttacaac cattgtgttt 1140 

acagtaagtg tgattaactt tagctgggga ggcagagggt gactgggtgc tgactggttt 1200 

atttatgtat atgtttttta tgtgtaggtc ccgtctctga cgtagatgag acccccacta 1260 

cagagtgcat ttcatcaccc ccagaaattg gcgaggaacc gcccgaagat attattcata 1320 

gaccagttgc agtgagagtc accgggcgta gagcagctgt ggagagtttg gatgacttgc 1380 

tacagggtgg ggatgaacct ttggacttgt gtacccggaa acgccccagg cactaagtgc 1440 

cacacatgtg tgtttactta aggtgatgtc agtatttata gggtgtggag tgcaataaaa 1500 

tccgtgttga ctttaagtgc gtggtttatg actcaggggt ggggactgtg ggtatataag 1560 

caggtgcaga cctgtgtggt cagttcagag caggactcat ggagatctgg acagtcttgg 1620 

aagactttca ccagactaga cagctgctag agaactcatc ggagggagtc tcttacctgt 1680 

ggagattctg cttcggtggg cctctagcta agctagtcta tagggccaag caggattata 1740 

aggatcaatt tgaggatatt ttgagagagt gtcctggtat ttttgactct ctcaacttgg 1800 

gccatcagtc tcactttaac cagagtattc tgagagccct tgacttttct actcctggca 1860 

gaactaccgc cgcggtagcc ttttttgcct ttatccttga caaatggagt caagaaaccc 1920 

atttcagcag ggattaccgt ctggactgct tagcagtagc tttgtggaga acatggaggt 1980 

gccagcgcct gaatgcaatc tccggctact tgccagtaca gccggtagac acgctgagga 2040 

tcctgagtct ccagtcaccc caggaacacc aacgccgcca gcagccgcag caggagcagc 2100 
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gtcgcccgag 


ttgtcctggt 


aggcgatctc 


ggtcatgaac 


8760 


tgctcgatct 


cctcctcctg 


aaggtctccg 


cgaccggcgc 


gctcgacggt 


ggccgcgagg 


8820 


tcgttggaga 


tgcggcccat 


gagctgcgag 


aaggcgttca 


tgccggcctc 


gttccagacg 


8880 


cggctgtaga 


ccacggctcc 


gtcggggtcg 


cgcgcgcgca 


tgaccacctg 


ggcgaggttg 


8940 
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agctcgacgt 


ga ccrccrt cfaa 


gaecgegtag 


ttgeagagge 


gctggtagag 


gtagttgagc 


9000 


gtggtggcga 


tgtgct cggt 


gacgaagaag 


tacatgatcc 


ageggeggag 

z3 zj z) 


eggcat ctcg 


9060 


ctgacgt cgc 


ccagggctt c 


caagcgctcc 


atggcct cgt 


agaagtccac 


ggcgaagttg 


9120 


aaaaactggg 


agttgcgcgc 


egagaeggt c 


aactcctcct 


ccagaagacg 


gatgagctcg 


9180 


gccxatacrtgcr 


cgcgcacct c 


gcgctcgaag 


gc cccggg crcf 


get cct ctt c 


ttccatctcc 


9240 


t cctcct ctt 


ccatct cctc 


cactaacat c 


tctt ctactt 


cct cct cagg 


aggeggegge 


9300 




ccctgcgt eg 


c egg egg cgc 


aegggcagae 


ggt cgatgaa 


geget cgatg 


9360 


gt ct ccccgc 


gccggcgacg 


catggtctcg 


gtgaeggege 


gee cgt cctc 


gcggggccgc 


9420 


agcgtgaaga 


cgccgccgcg 


catctccagg 


t ggccgccgg 


gggggt ctcc 


gttgggcagg 

z) ^^zJzjzzj^^zjz) 


9480 


gagagggcgc 


tgacgatgea 


tcttat caat 


tggecegtag 


ggactccgcg 


caagga cc tg 


9540 


agcgt ct cga 


gatccaeggg 


at ccgaaaac 


egctgaaega 


aggctt cgag 


ccagtcgcag 


9600 


tcgcaaggta 


ggctgagccc 


ggtttcttgt 


t ettegggta 


tttggtcggg 


aggegggegg 

J ZJ Zj ZD zj z> z) 


9660 


gcgatgctgc 


tggtgatgaa 


gttgaagtag 


gcggt cctga 


aacaacaaat 

zj zizi ^z) 


ggtggcgagg 

zj z? '—^zjz) z? z?z) 


9720 


agcaccaggt 


cctt gggece 


ggct tget gg 


at gegcagae 


ggt egg c cat 


gccccaggcg 


9780 


tggt cctgac 


a cct ggegag 


gt ccttgtag 


t agt cct gca 


tgagccgctc 


cacgggcacc 


9840 


tcct cct cgc 


c eg c g egg cc 


gtgeat gege 


gtgagecega 


acccgcgctg 


cggctggacg 


9900 


agcgccaggt 


egg cga cga c 


gcgctcggcg 


aggatggect 


gctggat ctg 


ggtgagggtg 

ZJ ZJ ^ ZJ ZJ ZJ ZD ZJ 


9960 


gtctggaagt 


cgt cgaagtc 


gaegaagegg 


tggtaggctc 


cggtgttgat 


ggtgtaggag 


10020 


cagttggcca 


tgaeggacca 


gttgacggtc 


tggtq/gccgg 

ZJ Zj zj zj zj zj 


ggegcacgag 

ZJ Z? ZJ ZJ ZJ 


ctcgtggtac 


10080 


ttgaggcgcg 


agtaggcgcg 


cgtgtcgaag 


atgtagtcgt 


tgcaggtgcg 


cacgaggtac 


10140 


tggtatccga 


cgaggaagtg 


cggcggcggc 


tggeggtaga 


gcggccatcg 


ctcggtggcg 


10200 


g-g-ggcgCCgg 


gcgcgaggt c 


ctcgagcatg 


agcrccfdtcrat 
w yy^yy ^z)z) ° 


age cgt a gat 


gtacctggac 


10260 


at ccaggtga 


tgccggcggc 




accrccfcaaaa 

ri r5 J J 3 ^ 


act cgeggae 


gcggttccag 


10320 


atgttgcgca 


gcggcaggaa 


gtagttcatg 


ataaccacaa 


tctggeccgt 


gaggegegeg 


10380 


cagtcgtgga 


tgetctagae 


ataegggcaa 


aaacgaaagc 


ggtcagegge 


tcgactccgt 


10440 


ggcctggagg 


etaagegaac 


gggttgggct 


gcgcgtgtac 


cccggttcga 


gtccctgctc 


10500 


gaatcaggct 


ggagccgcag 


ctaacgtggt 


actggcactc 


ccgtctcgac 


ccaagcctgc 


10560 


taacgaaacc 


tccaggatac 


ggaggcgggt 


cgttttggcc 


attttegtea 


ggccggaaat 


10620 


gaaactagta 


agegeggaaa 


gcggccgtcc 


gcgatggctc 


getgeegtag 


tctggagaaa 


10680 
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gaatcgccag 
ggattccgcg 
gacttctcca 
tactgcggca 
agccggcgct 
tgagcggagc 
tgggggcgtc 
cctacgtgcc 
gcgcctcccg 
tgagggacga 
tggccgcggc 
aaaaatcctt 
tgatgcacct 
cggcgcagct 
tgctgaatat 
gcatcgtggt 
cggtgctgag 
tagacaagga 
tgagcgacga 
gccggcgcga 
ccgggaccga 
gccgggcctt 
aggagggcga 
gccaccgccg 
taactcctcg 
tcccgaagcc 
ggtgccctcg 
ggagaacaag 



ggttgcgttg 
gctaacgtgg 
gttacggagc 
gatgcgcccc 
tctgcccccg 
cggcgttcag 
.gtcgccggag 
caagcagaac 
cttccacgcg 
ggatttcgag 
caacctggtc 
caacaaccac 
gtgggacctg 
gtttctggtg 
caccgagccc 
gcaggagcgc 
cctgggcaag 
ggtgaagatc 
tctgggggtg 
gctgagcgac 
gggggagagc 
ggaagctgcc 
gtacctggaa 
cctcctgatc 
gacgattgga 
tttagacagc 
cgctcgaacc 
gccatccgcg 



cggtgtgccc 
gcgtggctgc 
gagcccctct 
caccctccac 
ccccagcagc 
tatgacctgg 
cggcacccgc 
ctgttcagag 
gggcgggagc 
gcggacgagc 
acggcgtacg 
gtgcgcacgc 
ctggaggcca 
gtgcagcaca 
gagggccgct 
gggctgccgc 
tactacgcta 
gacgggtttt 
taccgcaacg 
caggagctga 
tactttgaca 
ggcggttccc 
gactgatggc 
ccgcgatgcg 
cccaggccat 
agcctcaggc 
ccacgcacga 
gcgacgaggc 



cggttcgagc 
cccgtcgttt 
ttttcttgtg 
cacaaccgcc 
agcagccagc 
ccttggaaga 
gcgtgcagat 
acaggagcgg 
tgcggcgcgg 
tgacggggat 
agcagaccgt 
tgatcgcgcg 
tcgtgcagaa 
gtcgggacaa 
ggctcctgga 
tgtccgagaa 
ggaagatcta 
acatgcgcat 
acaggatgca 
tgcacagcct 
tgggcgcgga 
cctacgtgga 
gcgaccgtat 
ggcggcgctg 
gcaacgcatc 
caaccgactc 
gaaggtgctg 
cgggctggtg 



ctcagcgctc 
ccaagacccc 
tttttgccag 
cctaccgcag 
cactaccgcg 
gggcgagggg 
gaaaagggac 
cgaggagccc 
cctggaccga 
cagccccgcg 
gaaggaggag 
cgaggaggtg 
ccccacgagc 
cgagacgttc 
cctggtgaac 
gctggcggcc 
caagaccccg 
gaccctgaaa 
ccgcgcggtg 
gcagcgggcc 
cctgcgctgg 
ggaggtggac 
ttttgctaga 
cagagccagc 
atggcgctga 
tcggccatcc 
gccatcgtga 
tacaacgcgc 



ggcgccggcc 
ttagccagcc 
atgcatcccg 
cagcagcaac 
gcggccgccg 
ctggcgcggc 
gctcgcgagg 
gaggagatgc 
aagcgggtgc 
cgcgcgcacg 
agcaacttcc 
accctgggcc 
aagccgctga 
agggaggcgc 
attctgcaga 
atcaacttct 
tacgtgccca 
gtgctgaccc 
agcgccagcc 
ctgaccgggg 
cagcctagcc 
gatgaggagg 
tgcagcaaca 
cgtccggcat 
cgacccgcaa 
tggaggccgt 
acgcgctggt 
tgctggagcg 



10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
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13620 
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13740 
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gacttgggtg 


ggagtggtgg 
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ttcgctcacc 


tgcgcccccg 


13800 


tatcgggcgc 


ctgatgtaag 


aatctgaaaa 


aataaaagac 


ggtactcacc 


aaggccatgg 


13860 


cgaccagcgt 


gcgttcttct 


ctgttgtttg 


tagtagt atg atg agg cgc gtg tac 


13915 



Met Met Arg Arg Val Tyr 
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ccg gag ggt cct cct ccc teg tac gag age gtg atg cag cag gcg gtg 13963 
Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser Val Met Gin Gin Ala Val 
10 15 20 
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gcg gcg gcg 
Ala Ala Ala 
25 

tac ctg gcg 
Tyr Leu Ala 
40 

gca ccc ttg 
Ala Pro Leu 
55 

gcg gac ate 
Ala Asp lie 



acc acc gtg 
Thr Thr Val 



cag acc ate 
Gin Thr He 
105 

acc ate atg 
Thr He Met 
120 

aac aag ttc 
Asn Lys Phe 
135 

gtc aca gta 
Val Thr Val 



gag ttt gag 
Glu Phe Glu 



ctg atg aac 
Leu Met Asn 
185 

aac ggg gtg 
Asn Gly Val 
200 

ttc egg ctg 
Phe Arg Leu 
215 

tac acc aac 
Tyr Thr Asn 



atg cag ccc 
Met Gin Pro 



cct acg gag 
Pro Thr Glu 



tac gat acc 
Tyr Asp Thr 
60 

gec teg ctg 
Ala Ser Leu 
75 

gtg cag aac 
Val Gin Asn 
90 

aac ttt gac 
Asn Phe Asp 



cac acc aac 
His Thr Asn 



aag gcg egg 
Lys Ala Arg 
140 

aca gat ggt 
Thr Asp Gly 
155 

ctg ccc gag 
Leu Pro Glu 
170 

aac gec ate 
Asn Ala lie 



ctg gag age 
Leu Glu Ser 



ggc tgg gac 
Gly Trp Asp 
220 

gag gee ttc 
Glu Ala Phe 
235 



ccg ctg gag 
Pro Leu Glu 
30 

ggg egg aac 
Gly Arg Asn 
45 

acc egg ttg 
Thr Arg Leu 



aac tac cag 
Asn Tyr Gin 



aac gat ttc 
Asn Asp Phe 
95 

gag cgc teg 
Glu Arg Ser 
110 

atg ccc aac 
Met Pro Asn 
125 

gtg atg gtc 
Val Met Val 



agt cag gac 
Ser Gin Asp 



ggc aac ttc 
Gly Asn Phe 
175 

ate gac aac 
He Asp Asn 
190 

gac ate ggc 
Asp He Gly 
205 

ccc gtg acc 
Pro Val Thr 



cac ccc gac 
His Pro Asp 



gcg cct tac 
Ala Pro Tyr 



age att cgt 
Ser He Arg 
50 

tac ctg gtg 
Tyr Leu Val 
65 

aac gac cac 
Asn Asp His 
80 

acc ccc acg 
Thr Pro Thr 



egg tgg ggc 
Arg Trp Gly 



gtg aac gag 
Val Asn Glu 
130 

teg cgc aag 
Ser Arg Lys 
145 

gag ctg acc 
Glu Leu Thr 
160 

teg gtg acc 
Ser Val Thr 



tac ttg gcg 
Tyr Leu Ala 



gtg aag ttc 
Val Lys Phe 
210 

gag ctg gtg 
Glu Leu Val 
225 

ate gtc ctg 
He Val Leu 
240 



gtg ccc ccg 
Val Pro Pro 
35 

tac teg gag 
Tyr Ser Glu 



gac aac aag 
Asp Asn Lys 



age aac ttc 
Ser Asn Phe 
85 

gag gee age 
Glu Ala Ser 
100 

ggc cag ctg 
Gly Gin Leu 
115 

ttc atg tac 
Phe Met Tyr 



acc ccc aac 
Thr Pro Asn 



tac gag tgg 
Tyr Glu Trp 
165 

atg acc ate 
Met Thr He 
180 

gtg ggg egg 
Val Gly Arg 
195 

gac acg cgc 
Asp Thr Arg 



atg ccg ggc 
Met Pro Gly 



ctg ccc ggc 
Leu Pro Gly 
245 



egg 14011 
Arg 



ctg 14059 
Leu 



teg 14107 

Ser 

70 

ctg 14155 
Leu 



acc 14203 
Thr 



aaa 14251 
Lys 



age 14299 
Ser 



ggg 14347 

Gly 

150 

gtg 14395 
Val 



gat 14443 
Asp 



cag 14491 
Gin 



aac 14539 
Asn 



gtg 14587 

Val 

230 

tgc 14635 
Cys 
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ggc gtg gac 
Gly Val Asp 



aag egg cag 
Lys Arg Gin 
265 

gag ggg ggc 
Glu Gly Gly 
280 

age aag gag 
Ser Lys Glu 
295 

tct acc gag 
Ser Thr Glu 



gcg gec gag 
Ala Ala Glu 



gag aag gac 
Glu Lys Asp 
345 

aac acc gec 
Asn Thr Ala 
360 

gag aag ggc 
Glu Lys Gly 
375 

tgc ggc gtg 
Cys Gly Val 



ccg gtc acc 
Pro Val Thr 



ggc gec gag 
Gly Ala Glu 
425 

gec gtc tac 
Ala Val Tyr 
440 

ttc aac cgc 
Phe Asn Arg 
455 



ttc acc gag 
Phe Thr Glu 
250 

ccc ttc cag 
Pro Phe Gin 



aac ate ccc 
Asn lie Pro 



gat age gec 
Asp Ser Ala 
300 

gtg egg ggc 
Val Arg Gly 
315 

gcg get gaa 

Ala Ala Glu 
330 

age aag gag 

Ser Lys Glu 



tac cgc age 
Tyr Arg Ser 



gtg cgc tec 
Val Arg Ser 
380 

gag caa gtc 
Glu Gin Val 
395 

ttc cgc tec 

Phe Arg Ser 
410 

etc ctg ccc 

Leu Leu Pro 



teg cag cag 
Ser Gin Gin 



ttc ccc gag 
Phe Pro Glu 
460 



age cgc etc 
Ser Arg Leu 
255 

gag ggc ttc 
Glu Gly Phe 
270 

gcg ctg ctg 
Ala Leu Leu 
285 

gec gcg gcg 
Ala Ala Ala 



gat aat ttt 
Asp Asn Phe 



acc gaa agt 
Thr Glu Ser 
335 

agg age tac 
Arg Ser Tyr 
350 

tgg tac ctg 

Trp Tyr Leu 
365 

tgg acg ctg 

Trp Thr Leu 



tac tgg teg 
Tyr Trp Ser 



acg cgt caa 
Thr Arg Gin 
415 

gtc tac tec 
Val Tyr Ser 
430 

ctg cgc gee 
Leu Arg Ala 
445 

aac cag ate 
Asn Gin lie 



age aac ctg 
Ser Asn Leu 



cag ate ctg 
Gin lie Leu 



gac gtg gac 
Asp Val Asp 
290 

acc gca gec 
Thr Ala Ala 
305 

get age gec 
Ala Ser Ala 
320 

aag ata gtg 
Lys lie Val 



aac gtg etc 
Asn Val Leu 



gee tac aac 
Ala Tyr Asn 
370 

etc acc acc 
Leu Thr Thr 
385 

ctg ccc gac 
Leu Pro Asp 
400 

gtt age aac 
Val Ser Asn 



aag age ttc 
Lys Ser Phe 



ttc acc teg 
Phe Thr Ser 
450 

etc gtt cgc 
Leu Val Arg 
465 



ctg ggc ate 
Leu Gly lie 
260 

tac gag gac 
Tyr Glu Asp 
275 

gec tac gag 
Ala Tyr Glu 



gtg gec acc 
Val Ala Thr 



gcg aca ctg 
Ala Thr Leu 

325 

ate cag ccg 
lie Gin Pro 
340 

gcg gac aag 
Ala Asp Lys 
355 

tac ggc gac 
Tyr Gly Asp 



teg gac gtc 
Ser Asp Val 



atg atg caa 
Met Met Gin 
405 

tac ccg gtg 
Tyr Pro Val 
420 

ttc aac gag 
Phe Asn Glu 
435 

etc acg cac 
Leu Thr His 



ccg ccc gcg 
Pro Pro Ala 



cgc 14683 
Arg 



ctg 14731 
Leu 



aaa 14779 
Lys 



gee 14827 

Ala 

310 

gca 14875 
Ala 



gtg 14923 
Val 



aaa 14971 
Lys 



ccc 15019 
Pro 



acc 15067 

Thr 

390 

gac 15115 
Asp 



gtg 15163 
Val 



cag 15211 
Gin 



gtc 15259 
Val 



ccc 15307 

Pro 

470 
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acc att acc acc gtc agt gaa aac gtt cct get etc aca gat cac ggg 15355 

Thr lie Thr Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly 

475 480 485 

acc ctg ccg ctg cgc age agt ate egg gga gtc cag cgc gtg acc gtc 15403 

Thr Leu Pro Leu Arg Ser Ser lie Arg Gly Val Gin Arg Val Thr Val 

490 495 500 

act gac gee aga cgc cgc acc tgc ccc tac gtc tac aag gee ctg ggc 15451 

Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly 

505 510 515 



gta gtc gcg ccg cgc gtc etc teg age cgc acc ttc taa aaaatgtcca 15500 
Val Val Ala Pro Arg Val Leu Ser Ser Arg Thr Phe 



520 




525 




530 






tt ct cat ct c 


geccagtaat 


aacaceggtt 


qqqqcctqcq 

3 3 3 3 3 3 


cgcgcccagc 


aagatgtacg 


15560 


gaggeget eg 


ccaacgct cc 


acgcaacacc 


ccgtgcgcgt 


qcqcqqqcac 

3 3 3 3 3 


ttccgcgctc 


15620 


cct era gq cgc 


cct caagggc 


cqcqtgcgct 

3 -3 3 3^ ^ 


cgcgcaccac 


cgtcgacgac 


gtgatcgacc 


15680 


aaatarcftgac 

^33 2 3 3 3 


cgacgcgcgc 


aactacacgc 


ccgccgccgc 


gcccgtctcc 


accgtggacg 


15740 


ccgtcatcga 


cagcgtggtg 


qccqacqcqc 

3 3 ^ ^ 3 3 


gccggtacgc 


ccgcgccaag 


agccggcggc 


15800 


ggcgcatcgc 


ccgqcqacac 

3 3 3 3 


cqqaqcaccc 

3 3 3 


ccgccatgcg 


cgcggcgcga 


gccttgctgc 


15860 


geagggecag 


gcgcacggga 


cgcagggcca 


tqctcagqqc 

3 3 3 3 


ggccagacgc 


gcggcctccg 


15920 


gcagcagcag 


cgccggcagg 


acccgcagac 


qcqcggccac 


ggcggcggcg 


gcggccatcg 


15980 


ccagcatgtc 


ccgcccgcgg 


cgcggcaacg 


tgtactgggt 


gcgcgacgcc 


gccaccggtg 


16040 


tgcgcgtgcc 


cgtgcgcacc 


cgcccccctc 


gcacttgaag 


atgetgaett 


cgcgatgttg 




atgtgtccca 


gcggcgagga 


ggatgtccaa 


gegcaaatte 


aaggaagaga 


tgctccaggt 


16160 


catcgcgcct' 


gagatctacg 


gcccggcggc 


ggtgaaggag 


gaaagaaagc 


cccgcaaact 


16220 


gaagcgggtc 


aaaaaggaca 


aaaaggagga 


ggaagatgtg 


gaeggactgg 


tggagtttgt 


16280 


gcgcgagttc 


gccccccggc 


ggcgcgtgca 


gtggcgcggg 


cggaaagtga 


aaceggtget 


16340 


gcgacccggc 


accacggtgg 


tcttcacgcc 


eggegagegt 


tccggctccg 


cctccaagcg 


16400 


ctcctacgac 


gaggtgtacg 


gggacgagga 


catcctcgag 


caggcggccg 


aacgtctggg 


16460 


egagtttget 


taeggcaage 


gcagccgccc 


cgcgcccttg 


aaagaggagg 


cggtgtccat 


16520 


cccgctggac 


cacggcaacc 


ccacgccgag 


cctgaagccg 


gtgaccctgc 


ageaggtget 


16580 


gcctggtgcg 


gcgccgcgcc 


ggggcttcaa 


gegegaggge 


ggegaggate 


tgtacccgac 


16640 


catgeagctg 


atggtgccca 


agegecagaa 


gctggaggac 


gtgctggagc 


acatgaaggt 


16700 


ggaccccgag 


gtgcagcccg 


aggtcaaggt 


gcggcccatc 


aagcaggtgg 


ccccgggcct 


16760 
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gggcgtgcag 


accgtggaca 


tcaagatccc 


cacggagccc 


atggaaacgc 


agaccgagcc 


16820 


cgtgaagccc 


agcaccagca 


ccatggaggt 


gcagacggat 


ccctggatgc 


cggcaccggc 


16880 


ttccaccacc 


cgccgaagac 


gcaagtacgg 


cgcggccagc 


ctgctgatgc 


ccaactacgc 


16940 


gctgcatcct 


tccatcatcc 


ccacgccggg 


ctaccgcggc 


acgcgcttct 


accgcggcta 


17000 


caccagcagc 


cgccgccgca 


agaccaccac 


ccgccgccgc 


cgtcgtcgca 


cccgccgcag 


17060 


cagcaccgcg 


acttccgccg 


ccgccctggt 


gcggagagtg 


taccgcagcg 


ggcgcgagcc 


17120 


tctgaccctg 


ccgcgcgcgc 


gctaccaccc 


gagcatcgcc 


atttaactac 


cgcctcctac 


17180 


ttgcagatat 


ggccctcaca 


tgccgcctcc 


gcgtccccat 


tacgggctac 


cgaggaagaa 


17240 


agccgcgccg 


tagaaggctg 


acggggaacg 


ggctgcgtcg 


ccatcaccac 


cggcggcggc 


17300 


gcgccatcag 


caagcggttg 


gggggaggct 


tcctgcccgc 


gctgatgccc 


atcatcgccg 


17360 


cggcgatcgg 


ggcgatcccc 


ggcatagctt 


ccgtggcggt 


gcaggcctct 


cagcgccact 


17420 


gagacacagc 


ttggaaaatt 


tgtaataaaa 


aatggactga 


cgctcctggt 


cctgtgatgt 


17480 


gtgtttttag 


atggaagaca 


tcaatttttc 


gtccctggca 


ccgcgacacg 


gcacgcggcc 


17540 


gtttatgggc 


acctggagcg 


acatcggcaa 


cagccaactg 


aacgggggcg 


ccttcaattg 


17600 


gagcagtctc 


tggagcgggc 


ttaagaattt 


cgggtccacg 


ctcaaaacct 


atggcaacaa 


17660 


ggcgtggaac 


agcagcacag 


ggcaggcgct 


gagggaaaag 


ctgaaagagc 


agaacttcca 


17720 


gcagaaggtg 


gtcgatggcc 


tggcctcggg 


catcaacggg 


gtggtggacc 


tggccaacca 


17780 


ggccgtgcag 


aaacagatca 


acagccgcct 


ggacgcggtc 


ccgcccgcgg 


ggtccgtgga 


17840 


gatgccccag 


gtggaggagg 


agctgcctcc 


cctggacaag 


cgcggcgaca 


agcgaccgcg 


17900 


tcccgacgcg 


gaggagacgc 


tgctgacgca 


cacggacgag 


ccgcccccgt 


acgaggaggc 


17960 


ggtgaaactg 


ggtctgccca 


ccacgcggcc 


cgtggcgcct 


ctggccaccg 


gggtgctgaa 


18020 


acccagcagc 


agcagcagcc 


agcccgcgac 


cctggacttg 


cctccgcctg 


cttcccgccc 


18080 


ctccacagtg 


gctaagcccc 


tgccgccggt 


ggccgtcgcg 


tcgcgcgccc 


cccgaggccg 


18140 


cccccaggcg 


aactggcaga 


gcactctgaa 


cagcatcgtg 


ggtctgggag 


tgcagagtgt 


18200 


gaagcgccgc 


cgctgctatt 


aaaagacact 


gtagcgctta 


acttgcttgt 


ctgtgtgtat 


18260 


atgtatgtcc 


gccgaccaga 


aggaggagga 


agaggcgcgt 


cgccgagttg 


caag atg 


18317 



Met 



13 



WO 03/046124 



PCT/US02/33645 



gcc acc cca teg atg ctg ccc cag tgg gcg tac atg cac ate gec gga 18365 

Ala Thr Pro Ser Met Leu Pro Gin Trp Ala Tyr Met His lie Ala Gly 

535 540 545 

cag gac get teg gag tac ctg agt ccg ggt ctg gtg cag ttc gcc cgc 18413 

Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala Arg 

550 555 560 

gcc aca gac acc tac ttc agt ctg ggg aac aag ttt agg aac ccc acg 18461 

Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro Thr 

565 570 575 

gtg gcg ccc acg cac gat gtg acc acc gac cgc age cag egg ctg acg 18509 

Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu Thr 

580 585 590 595 

ctg cgc ttc gtg ccc gtg gac cgc gag gac aac acc tac teg tac aaa 18557 

Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr Lys 

600 605 610 

gtg cgc tac acg ctg gcc gtg ggc gac aac cgc gtg ctg gac atg gcc 18 605 

Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met Ala 

615 620 625 

age acc tac ttt gac ate cgc ggc gtg ctg gat egg gge cct age ttc 18653 

Ser Thr Tyr Phe Asp lie Arg Gly Val Leu Asp Arg Gly Pro Ser Phe 

630 635 640 

aaa ccc tac tec ggc acc get tac aac age ctg get ccc aag gga gcg 18701 

Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly Ala 

645 650 " 655 

ccc aac act tgc cag tgg aca tat aaa get gat ggt gat act ggt aca 18749 

Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Asp Gly Asp Thr Gly Thr 

660 " 665 ~ ~ 670 ^ 675 

gaa aaa acc tat aca tat gga aat gcg cct gtg caa ggc att agt att 18797 

Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gin Gly lie Ser lie 

680 ~ 685 " 690 

aca aaa gat ggt att caa ctt gga act gac act gat gat cag ccc att 18845 

Thr Lys Asp Gly lie Gin Leu Gly Thr Asp Thr Asp Asp Gin Pro lie 

695 700 705 

tat gca gat aaa act tat caa cca gag cct caa gtg ggt gat get gaa 18 893 

Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin Val Gly Asp Ala Glu 

710 715 720 

tgg cat gac ate act ggt act gat gaa aaa tat gga ggc aga get etc 18941 

Trp His Asp lie Thr Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala Leu 

725 730 735 

aag cct gac acc aaa atg aag ccc tgc tat ggt tct ttt gcc aag cct 18989 

Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro 

740 745 750 755 
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acc aat aaa gaa gga ggt cag gca aat gtg aaa acc gaa aca ggc ggt 19037 

Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys Thr Glu Thr Gly Gly 

760 765 770 

acc aaa gaa tat gac att gac atg gca ttc ttc gat aat cga agt gca 19085 

Thr Lys Glu Tyr Asp lie Asp Met Ala Phe Phe Asp Asn Arg Ser Ala 

775 780 785 

get gcg get ggc ctg gec cca gaa att gtt ttg tat act gag aat gtg 19133 

Ala Ala Ala Gly Leu Ala Pro Glu lie Val Leu Tyr Thr Glu Asn Val 
790 795 800 



gat ctg gaa act cca gat act cat att gta tac aag gcg ggc aca gat 
Asp Leu Glu Thr Pro Asp Thr His lie Val Tyr Lys Ala Gly Thr Asp 
805 810 815 



19181 



gac age age tct tct ate aat ttg ggt cag cag tec atg ccc aac aga 
Asp Ser Ser Ser Ser lie Asn Leu Gly Gin Gin Ser Met Pro Asn Arg 
820 825 830 835 



19229 



ccc aac tac att ggc ttt aga gac aac ttt ate ggg etc atg tac tac 
Pro Asn Tyr lie Gly Phe Arg Asp Asn Phe lie Gly Leu Met Tyr Tyr 
840 845 " 850 



19277 



aac age act ggc aac atg ggc gtg ctg get ggt cag gee tec cag ctg 
Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu 
855 860 865 



19325 



aat get gtg gtg gac ttg cag gac aga aac act gaa ctg tec tac cag 
Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin 
870 875 880 



19373 



etc ttg ctt gac tct ctg ggc gac aga acc agg tat ttc agt atg tgg 19421 
Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp 
885 890 895 

aat cag gcg gtg gac age tat gac ccc gat gtg cgc att att gaa aat 19469 
Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg He He Glu Asn 
900 905 910 915 



cac ggt gtg gag gat gaa etc cct aac tat tgc ttc ccc ctg gat get 
His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Ala 
920 925 930 



19517 



gtg ggt aga act gat act tac cag gga att aag gee aat ggt get gat 19565 
Val Gly Arg Thr Asp Thr Tyr Gin Gly He Lys Ala Asn Gly Ala Asp 
935 940 945 

caa acc acc tgg acc aaa gat gat act gtt aat gat get aat gaa ttg 19613 
Gin Thr Thr Trp Thr Lys Asp Asp Thr Val Asn Asp Ala Asn Glu Leu 
950 955 960 



ggc aag ggc aat cct ttc gee atg gag ate aac ate cag gee aac ctg 
Gly Lys Gly Asn Pro Phe Ala Met Glu He Asn He Gin Ala Asn Leu 
965 970 975 



19661 
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tgg egg aac ttc etc tac gcg aac gtg gcg ctg tac ctg ccc gac tec 19709 
Trp Arg Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp Ser 

980 985 990 995 

tac aag tac acg ccg gec aac ate acg ctg ccg acc aac ace aac 19754 

Tyr Lys Tyr Thr Pro Ala Asn lie Thr Leu Pro Thr Asn Thr Asn 

1000 1005 1010 

acc tac gat tac atg aac ggc cgc gtg gtg gcg ccc teg ctg gtg 19799 

Thr Tyr Asp Tyr Met Asn Gly Arg Val Val Ala Pro Ser Leu Val 

1015 1020 1025 

gac gee tac ate aac ate ggg gcg cgc tgg teg ctg gac ccc atg 19844 

Asp Ala Tyr lie Asn He Gly Ala Arg Trp Ser Leu Asp Pro Met 

1030 1035 1040 

gac aac gtc aac ccc ttc aac cac cac cgc aac gcg ggc ctg cgc 19889 

Asp Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg 

1045 1050 1055 

tac cgc tec atg etc ctg ggc aac ggg cgc tac gtg ccc ttc cac 19934 

Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His 

1060 "* 1065 1070 

ate cag gtg ccc caa aag ttc ttc gec ate aag age etc ctg etc 19979 

He Gin Val Pro Gin Lys Phe Phe Ala He Lys Ser Leu Leu Leu 

1075 1080 1085 

ctg ccc ggg tec tac acc tac gag tgg aac ttc cgc aag gac gtc 20024 

Leu Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val 

1090 1095 1100 

aac atg ate ctg cag age tec etc ggc aac gac ctg cgc acg gac 20069 

Asn Met He Leu Gin Ser Ser Leu Gly Asn Asp Leu Arg Thr Asp 

1105 1110 1115 

ggg gec tec ate gec ttc acc age ate aac etc tac gee acc ttc 20114 

Gly Ala Ser He Ala Phe Thr Ser He Asn Leu Tyr Ala Thr Phe 

1120 1125 1130 

ttc ccc atg gcg cac aac acc gec tec acg etc gag gee atg ctg 20159 

Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu 

1135 1140 1145 

cgc aac gac acc aac gac cag tec ttc aac gac tac etc teg gcg 20204 

Arg Asn Asp Thr Asn Asp Gin Ser Phe Asn Asp Tyr Leu Ser Ala 

1150 1155 1160 

gec aac atg etc tac ccc ate ccg gee aac gec acc aac gtg ccc 20249 

Ala Asn Met Leu Tyr Pro He Pro Ala Asn Ala Thr Asn Val Pro 

1165 1170 1175 

ate tec ate ccc teg cgc aac tgg gee gee ttc cgc gga tgg tec 20294 

He Ser He Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser 

1180 1185 1190 
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ttc acg cgc etc aag acc cgc gag acg ccc teg etc ggc tec ggg 20339 
Phe Thr Arg Leu Lys Thr Arg Glu Thr Pro Ser Leu Gly Ser Gly 
1195 1200 1205 

ttc gac ccc tac ttc gtc tac teg ggc tec ate ccc tac etc gac 20384 
Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser lie Pro Tyr Leu Asp 
1210 ~ 1215 ~ 1220 

ggc acc ttc tac etc aac cac acc ttc aag aag gtc tec ate acc 20429 
Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser lie Thr 
1225 1230 1235 

ttc gac tec tec gtc age tgg ccc ggc aac gac cgc etc ctg acg 20474 
Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr 
1240 1245 ~ 1250 

ccc aac gag ttc gaa ate aag cgc acc gtc gac gga gag ggg tac 20519 
Pro Asn Glu Phe Glu lie Lys Arg Thr Val Asp Gly Glu Gly Tyr 
1255 1260 J 1265 

aac gtg gee cag tgc aac atg acc aag gac tgg ttc ctg gtc cag 20564 
Asn Val Ala Gin Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gin 
1270 " 1275 1280 

atg ctg gee cac tac aac ate ggc tac cag ggc ttc tac gtg ccc 20609 
Met Leu Ala His Tyr Asn lie Gly Tyr Gin Gly Phe Tyr Val Pro 
1285 1290 1295 

gag ggc tac aag gac cgc atg tac tec ttc ttc cgc aac ttc cag 20654 
Glu Gly Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin 
1300 1305 1310 

ccc atg age cgc cag gtc gtg gac gag gtc aac tac aag gac tac 20699 
Pro Met Ser Arg Gin Val Val Asp Glu Val Asn Tyr Lys Asp Tyr 
1315 1320 1325 

cag gee gtc acc ctg gee tac cag cac aac aac teg ggc ttc gtc 20744 
Gin Ala Val Thr Leu Ala Tyr Gin His Asn Asn Ser Gly Phe Val 
1330 1335 1340 

ggc tac etc gcg ccc ace atg cgc cag gga cag ccc tac ccc gee 20789 
Gly Tyr Leu Ala Pro Thr Met Arg Gin Gly Gin Pro Tyr Pro Ala 
1345 1350 1355 

aac tac ccc tac ccg etc ate ggc aag age gee gtc gee age gtc 20834 
Asn Tyr Pro Tyr Pro Leu lie Gly Lys Ser Ala Val Ala Ser Val 
1360 1365 1370 

acc cag aaa aag ttc etc tgc gac egg gtc atg tgg cgc ate ccc 20879 
Thr Gin Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg lie Pro 
1375 1380 ^ 1385 

ttc tec age aac ttc atg tec atg ggc gcg etc acc gac etc ggc 20924 
Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly 
1390 1395 ~ 1400 
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cag aac atg etc tac gec aac tec gec cac gcg eta gac atg aat 20969 

Gin Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn 

1405 1410 1415 

ttc gaa gtc gac ccc atg gat gag tec ace ctt etc tat gtt gtc 21014 

Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val 

1420 1425 " 1430 

ttc gaa gtc ttc gac gtc gtc cga gtg cac cag ccc cac cgc ggc 21059 

Phe Glu Val Phe Asp Val Val Arg Val His Gin Pro His Arg Gly 

1435 1440 ~ 1445 

gtc ate gag gec gtc tac ctg cgc acg ccc ttc teg gec ggc aac 21104 

Val lie Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn 

1450 1455 1 1460 

gee ace ace taa gccccgctct tgcttcttgc aagatgaegg cctgtgcggg 21156 
Ala Thr Thr 



ctccggcgag 


caggagctca 


gggccatcct 


ccgcgacctg 


ggctgeggge 


cctgcttcct 


21216 


gggcaccttc 


gaeaageget 


tecegggatt 


catggccccg 


cacaagctgg 


cctgcgccat 


21276 


cgtcaacacg 


gccggccgcg 


agaceggggg 


cgagcactgg 


ctggccttcg 


cctggaaccc 


21336 


gcgctcccac 


acctgctacc 


tcttcgaccc 


ettegggtte 


teggacgage 


gcctcaagca 


21396 


<Tia "f - "f" /-"» f* /t 
yd LL LdL-Cdy 


T-.T_cgagr.acg 


agggectget 


gcgccgcagc 


gccctggcca 


ccgaggaccg 


21456 


ctgcgtcacc 


ctggaaaagt 


ccacccagac 


cgtgcagggt 


ccgcgctcgg 


ccgcctgcgg 


21516 


gctcttctgc 


tgcatgttcc 


tgcacgcctt 


cgtgcactgg 


cccgaccgcc 


ccatggacaa 


21576 


gaaccccacc 


atgaacttgc 


tgacgggggt 


gcccaacggc 


atgctccagt 


cgccccaggt 


21636 


ggaacccacc 


ctgcgccgca 


accaggaggc 


gctctaccgc 


ttcctcaacg 


cccactccgc 


21696 


ctactttcgc 


tcccaccgcg 


cgcgcatcga 


gaaggccacc 


gccttcgacc 


gcatgaatca 


21756 


agacatgtaa 


accgtgtgtg 


tatgtgaatg 


ctttattcat 


aataaacagc 


acatgtttat 


21816 


gccacctttt 


ctgaggctct 


gactttattt 


agaaatcgaa 


ggggttctgc 


cggctctcgg 


21876 


cgtgccccgc 


gggcagggat 


acgttgcgga 


actggtactt 


gggcagccac 


ttgaactegg 


21936 


ggatcagcag 


cttcggcacg 


gggaggtegg 


ggaacgagtc 


gctccacagc 


ttgcgcgtga 


21996 


gttgeaggge 


geccagcagg 


tegggegegg 


agatcttgaa 


atcgcagttg 


ggacccgcgt 


22056 


tctgcgcgcg 


ggagttgcgg 


tacaeggggt 


tgcagcactg 


gaacaccatc 


agggcegggt 


22116 


gcttcacgct 


cgccagcacc 


gtcgegtegg 


tgatgccctc 


cacgtccaga 


tcctcggcgt 


22176 


tggccatccc 


gaagggggtc 


atettgeagg 


tctgccgccc 


catgetggge 


acgcagccgg 


22236 


gcttgtggtt 


geaategcag 


tgcaggggga 


tcagcatcat 


ctgggcctgc 


teggagctea 


22296 
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tgcccgggta 
cgccctcggt 
cgtcgtgcac 
ggttctgggt 
tcgccacatc 
gcagcttgcc 
agttcttgtg 
tcgtggtcag 
catacaggtg 
acttcaggtc 
tctcccaggc 
tcgccgccgc 
ccttctcggt 
cctgcctttc 
ggggtttctt 
cgctcaccac 
gcctcttctg 
agccccttcc 
ggccggccat 
catcgccatc 
ccccgccgcc 
aatccatcga 
cgcgcttttc 
agcagcagca 
tcaagcatct 
tgcccctcag 
tgccccccaa 
cggtcttcgc 



catggccttc 
gaagaagacc 
gcagcagcgc 
gatcttggcc 
catctcgatc 
ctcggcctcg 
ggcgatctgg 
ggtcttgttg 
gcagatgcgg 
gctctccacg 
cgagacgatc 
tgaggtcagg 
gatgcgcacg 
gtcctcgctg 
tttgggcggc 
gactatttct 
gggcagaggc 
gcgttcgggg 
tgtgttctcc 
tgcccccgcc 
cagccccacc 
gattgacctg 
agccccggaa 
ggctgggctc 
ggcccgccaa 
cgtggcggag 
gcgccagccc 
ggtgcccgag 



atgaaagcct 
ccgcaggact 
gcgtcgttgt 
cggtcggggt 
gtgtgctcct 
gtgcacccgt 
gagtgcgagt 
ctggtgaagg 
cggtacacct 
cggtaccggt 
ggcaggctca 
gggtcgttct 
gggggaaagc 
tcctggctga 
agaggcggcg 
tcttcttggc 
ggaggcgacg 
gtgcgctcct 
tagggagcaa 
gccgccgacg 
tccgacgccg 
ggctacgtga 
gagaaccacc 
gagcatggcg 
tgcatcatcg 
ctcagccgcg 
aacggcacct 
gccctggcca 



ccagctggcg 
tgctagagaa 
tggccagctg 
tctccttcag 
tctggatcat 
gcagccacag 
gcacgaagcc 
tcagcgggat 
cgccctgctc 
ccatcagcag 
gggggttctt 
cgtccagggt 
tgaagcccac 
tgtcttgcaa 
gcggagacgt 
cgtcgtccga 
ggctctcgcg 
ggcggcgctg 
caagcatgga 
agaaccagca 
ccgcggcccc 
cgcccgcgga 
aagagcagcc 
actacctgag 
tcaaggacgc 
cctacgagcg 
gcgagcccaa 
cctaccacct 



gaaggcctgc 
ctggttggtg 
caccacgctg 
cgcgcgctgc 
cacggtcccg 
cgcgcagccg 
ctgcaggaag 
gccgcggtgc 
gggcatcagc 
cgtcatgact 
caccgccgtt 
ctcaaacact 
ggccgccagc 
aggcacatgc 
gctgggcgag 
gaccacgcgg 
gttcggcggg 
ctctgactga 
gactcagcca 
gcagaatgaa 
agacatgcaa 
gcacgaggag 
agagcaggaa 
cggggcagag 
gctgctcgac 
caacctcttc 
cccgcgcctc 
ctttttcaag 



tgcgccttgc 
gcgcagccgg 
cgcccccagc 
ccgttctcgc 
tgcaggcatc 
gtgcactccc 
cggcccatca 
tcctcgttca 
tggaaggcgg 
tccatgccct 
gtcatcttag 
cgcttgccgt 
tcctcctcgg 
ttggtcttgc 
cgcgagttct 
cggtaggcat 
cggctggcag 
cttcctccgc 
tcgtcgccaa 
agcttaaccg 
gagatggagg 
gagctggcag 
gcagagagcg 
gacgtgctca 
cgcgccgagg 
tcgccgcgcg 
aacttctacc 
aaccaaagga 



22356 
22416 
22476 
22536 
22596 
22656 
22716 
22776 
22836 
22896 
22956 
23016 
23076 
23136 
23196 
23256 
23316 
23376 
23436 
23496 
23556 
23616 
23676 
23736 
23796 
23856 
23916 
23976 
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tccccgtctc 


ctgccgcgcc 


aaccgcaccc 


gcgccgacgc 


cctgctcaac 


ctgggtcccg 


24036 


gcgcccgcct 


acctgatatc 


gcctccttgg 


aagaggttcc 


caagatcttc 


gagggtctgg 


24096 


gcagcgacga 


gactcgggcc 


gcgaacgctc 


tgcaaggaag 


cggagaggag 


catgagcacc 


24156 


acagcgccct 


ggtggagttg 


gaaggcgaca 


acgcgcgcct 


ggcggtgctc 


aagcgcacgg 


24216 


tcgagctgac 


ccacttcgcc 


tacccggcgc 


tcaacctgcc 


ccccaaggtc 


atgagcgccg 


24276 


tcatggacca 


ggtgctcatc 


aagcgcgcct 


cgcccctctc 


ggatgaggac 


atgcaggacc 


24336 


ccgagagctc 


ggacgagggc 


aagcccgtgg 


tcagcgacga 


gcagctggcg 


cgctggctgg 


24396 


gagcgagtag 


caccccccag 


agcttggaag 


agcggcgcaa 


gctcatgatg 


gccgtggtcc 


24456 


tggtgaccgt 


ggagctggag 


tgtctgcgcc 


gcttcttcgc 


cgacgcagag 


accctgcgca 


24516 


aggtcgagga 


gaacctgcac 


tacctcttca 


ggcacgggtt 


tgtgcgccag 


gcctgcaaga 


24576 


tctccaacgt 


ggagctgacc 


aacctggtct 


cctacatggg 


catcctgcac 


gagaaccgcc 


24636 


tggggcagaa 


cgtgctgcac 


accaccctgc 


gcggggaggc 


ccgccgcgac 


tacatccgcg 


24696 


actgcgtcta 


cctgtacctc 


tgccacacct 


ggcagacggg 


catgggcgtg 


tggcagcagt 


24756 


gcctggagga 


gcagaacctg 


aaagagctct 


gcaagctcct 


gcagaagaac 


ctgaaggccc 


24816 


tgtggaccgg 


gttcgacgag 


cgcaccaccg 


cctcggacct 


ggccgacctc 


atctt ccccg 


24876 


agcgcctgcg 


gctgacgctg 


cgcaacggac 


tgcccgactt 


tatgagtcaa 


agcatgttgc 


24936 


aaaactttcg 


ctctttcatc 


ctcgaacgct 


ccgggatcct 


gcccgccacc 


tgctccgcgc 


24996 


tgccctcgga 


cttcgtgccg 


ctgaccttcc 


gcgagtgccc 


cccgccgctc 


tggagccact 


25056 


gctacctgct 


gcgcctggcc 


aactacctgg 


cctaccactc 


ggacgtgatc 


gaggacgtca 


25116 


gcggcgaggg 


tctgctcgag 


tgccactgcc 


gctgcaacct 


ctgcacgccg 


caccgctccc 


25176 


tggcctgcaa 


cccccagctg 


ctgagcgaga 


cccagatcat 


cggcaccttc 


gagttgcaag 


25236 


gccccggcga 


gggcaagggg 


ggtctgaaac 


tcaccccggg 


gctgtggacc 


tcggcctact 


25296 


tgcgcaagtt 


cgtgcccgag 


gactaccatc 


ccttcgagat 


caggttctac 


gaggaccaat 


25356 


cccagccgcc 


caaggccgaa 


ctgtcggcct 


gcgtcatcac 


ccagggggcc 


atcctggccc 


25416 


cicLu. i— y < — ■ cj. utj ^ 


Uu L<ay add 


L.cccgccaag 


aatttctgct 


gaaaaagggc 


cacggggtct 


25476 


acctggaccc 


ccagaccgga 


gaggagctca 


accccagctt 


cccccaggat 


gccccgagga 


25536 


agcagcaaga 


agctgaaagt 


ggagctgccg 


ccgccggagg 


atttggagga 


agactgggag 


25596 


agcagtcagg 


cagaggagga 


ggagatggaa 


gactgggaca 


gcactcaggc 


agaggaggac 


25656 


agcctgcaag 


acagtctgga 


agacgaggtg 


gaggaggagg 


cagaggaaga 


agcagccgcc 


25716 
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gccagaccgt 


cgtcctcggc 


ggagaaagca 


agcagcacgg 


ataccatctc 


cgctccgggt 


25776 


cggggtcgcg 


gcgaccgggc 


ccacagtagg 


tgggacgaga 


ccgggcgctt 


cccgaacccc 


25836 


accacccaga 


ccggtaagaa 


ggagcggcag 


ggatacaagt 


cctggcgggg 


gcacaaaaac 


25896 


gccatcgtct 


cctgcttgca 


agcctgcggg 


ggcaacatct 


ccttcacccg 


ccgctacctg 


25956 


ctcttccacc 


gcggggtgaa 


cttcccccgc 


aacatcttgc 


attactaccg 


tcacctccac 


26016 


agcccctact 


actgtttcca 


agaagaggca 


gaaacccagc 


agcagcagaa 


aaccagcggc 


26076 


agcagcagct 


agaaaatcca 


cagcggcggc 


aggtggactg 


aggatcgcag 


cgaacgagcc 


26136 


ggcgcagacc 


cgggagctga 


ggaaccggat 


ctttcccacc 


ctctatgcca 


tcttccagca 


26196 


gagtcggggg 


caggagcagg 


aactgaaagt 


caagaaccgt 


tctctgcgct 


cgctcacccg 


26256 


cagttgtctg 


tatcacaaga 


gcgaagacca 


acttcagcgc 


actctcgagg 


acgccgaggc 


26316 


tctcttcaac 


aagtactgcg 


cgctcactct 


taaagagtag 


cccgcgcccg 


cccacacacg 


26376 


gaaaaaggcg 


ggaattacgt 


caccacctgc 


gcccttcgcc 


cgaccatcat 


catgagcaaa 


26436 


gagattccca 


cgccttacat 


gtggagctac 


cagccccaga 


tgggcctggc 


cgccggcgcc 


26496 


gcccaggact 


actccacccg 


catgaactgg 


ctcagcgccg 


ggcccgcgat 


gatctcacgg 


26556 


gtgaatgaca 


tccgcgcccg 


ccgaaaccag 


atactcctag 


aacagtcagc 


gatcaccgcc 


26616 


acgccccgcc 


atcaccttaa 


tccgcgtaat 


tggcccgccg 


ccctggtgta 


ccaggaaatt 


26676 


ccccagccca 


cgaccgtact 


acttccgcga 


gacgcccagg 


ccgaagtcca 


gctgactaac 


26736 


tcaggtgtcc 


agctggccgg 


cggcgccgcc 


ctgtgtcgtc 


accgccccgc 


tcagggtata 


26796 


aagcggctgg 


tgatccgagg 


cagaggcaca 


cagctcaacg 


acgaggtggt 


gagctcttcg 


26856 


ctgggtctgc 


gacctgacgg 


agtcttccaa 


ctcgccggat 


cggggagatc 


ttccttcacg 


26916 


cctcgtcagg 


ccgtcctgac 


tttggagagt 


tcgtcctcgc 


agccccgctc 


gggtggcatc 


26976 


ggcactctcc 


agttcgtgga 


ggagttcact 


ccctcggtct 


acttcaaccc 


cttctccggc 


27036 


tcccccggcc 


actacccgga 


cgagttcatc 


ccgaacttcg 


acgccatcag 


cgagtcggtg 


27096 


gacggctacg 


attgaatgtc 


ccatggtggc 


gcagctgacc 


tagctcggct 


tcgacacctg 


27156 


gaccactgcc 


gccgcttccg 


ctgcttcgct 


cgggatctcg 


ccgagtttgc 


ctactttgag 


27216 


ctgcccgagg 


agcaccctca 


gggcccggcc 


cacggagtgc 


ggatcatcgt 


cgaagggggc 


27276 


ctcgactccc 


acctgcttcg 


gatcttcagc 


cagcgaccga 


tcctggtcga 


gcgcgagcaa 


27336 


ggacagaccc 


ttctgaccct 


gtactgcatc 


tgcaaccacc 


ccggcctgca 


tgaaagtctt 


27396 
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tgttgtctgc 
ttgtggtgtt 
gcttcagtgt 
cgttgtcaac 
ttccacccgc 
cgtctcggga 
cgctactaac 
taccacccac 
ctgggaggtg 
ctacctatac 
gggaagatca 
ggactgggcg 
gacaattgcc 
ggatgggaat 
gcgtccgtgt 
ggctccccgc 
tggatgagca 
atcgcttaca 
ctcatcgcta 
tcacacacct 
ttactgttat 
gaattccaaa 
tgtgtggtaa 
atttgaaatt 
ataacaacat 
caaccaccac 
ttgccatgcg 
ccaaatccat 
gcatggtgta 



tgtgtactga 
cctgctatca 
aagccccaca 
cactgcgaca 
agaagcaagc 
ccctgccatc 
aaccaaacta 
accggaggtg 
gtggggttaa 
ctcccttgct 
ccctagtgag 
gcgcggctgt 
agctgagttt 
gcgagaacgt 
ggcagcccgg 
gcaccgtgaa 
agcagtacga 
gcgcgtgcac 
ttcgccccag 
ttttcagacc 
aagtaatgag 
atcctcatgg 
caagggaaaa 
aattaatatc 
gcagttttat 
cacaaaaact 
tcaaatggtc 
gattggcatt 
ctatgccttc 



gtataataaa 
accggtccct 
agaagtacct 
acgacggagt 
tccagctctt 
acaccttcca 
cccaccatcg 
agctccgagg 
tagcgctagg 
gttcgtactt 
ctgcggtgcg 
agtgaaggag 
tcagcccgat 
gagaatcgag 
ggaccccgag 
caatactttc 
tatgtggccc 
ggcgctaatc 
aaataatgcc 
atggcctctg 
aaactcacta 
tattgctatt 
aatacaagca 
actaaagagt 
gaagttactg 
acacctgtta 
aacaatagca 
attgttgctg 
tgctacagaa 



agctgagatc 
gttcttcacc 
cacctggctg 
cctgctgagc 
ccaacccttc 
cctgatcccg 
ccaccgtcgc 
tcgaccaacc 
cctagttgtg 
agtggtgctg 
ctggtggcgg 
aaggccgatc 
ggcaatcggt 
tacaataaca 
tggtacaccg 
atttttgcgc 
cccacgaagg 
accgctatcg 
gaaaaagaga 
ttaaattttt 
tttacattgg 
ttgatcaaga 
ttcatttaat 
atggaggtat 
taactaatcc 
ccactatgca 
ctcaacccac 
tagtggtgtg 
agcacagact 



agcgactact 
gggaacgaga 
ttccagggct 
ggccccgcca 
ctccccggga 
aataccacag 
gacctttctg 
tctgggattt 
ggtgggcttt 
tgttgctggt 
tggtggtgtt 
cctgcttgca 
gcgcggtgct 
agactcggaa 
tctctgtccc 
acatgtgcga 
agaacatcgt 
tgtgcctgag 
aacagccata 
gcttttattt 
cactaaccac 
tccagactta 
taactttaat 
gtattactat 
caccacacct 
gctcactacc 
cccacccagt 
catgttgatc 
gaacgacaag 



ccggactcga 
ccgagctcca 
ccccgatcgc 
accttacttt 
cctatcagtg 
cgccgctccc 
aatctaacac 
actacggccc 
tggctctctg 
ttaagaaatg 
ttcgattgtg 
tttcaatccc 
gatcaagtgc 
caatactctc 
cggtgctgac 
cacggtcatg 
ggtcttctcc 
cattcacatg 
acacgttttt 
gccagtctca 
actttagacg 
actatagaac 
tgcggagaca 
gttgcagaaa 
agaacaacaa 
aataacattt 
gaggaaattc 
atcgccttgt 
ctggaacact 



27456 
27516 
27576 
27636 
27696 
27756 
27816 
27876 
27936 
27996 
28056 
28116 
28176 
28236 
28296 
28356 
28416 
28476 
28536 
28596 
28656 
28716 
28776 
28836 
28896 
28956 
29016 
29076 
29136 
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tactaagtgt 
ctatcattac 
ccaattatac 
ctgacgagca 
tctctaatta 
ctggcagcta 
tggttgatcc 
aaaccacagc 
gcattacccc 
tgctttcggg 
ggctttaccg 
catgaaggca 
ctttttgaaa 
tgctcaaaat 
gaatgtcagt 
tcaaaagggt 
gcatactctt 
taccacacag 
caccactaca 
ggccccatct 
tgtcgagagc 
gctttcctct 
cctgaagcaa 
ggtcatcctg 
caagccggcc 
tctaaggaat 
tcttgatcac 
acgccagtcc 



tgaattttaa 
ctctgctcta 
actgaaaggt 
acagacagag 
tcaatgcaat 
cacctgccct 
cactactcca 
agaggaggca 
tacacctgat 
attagcagtc 
acaaaaatca 
gttagcactc 
caaatcaatg 
accacctgga 
acttatacat 
tggattaaag 
atctatgaca 
acaacccaca 
acagcagagg 
agcagtccca 
cacaccacag 
acaccaatca 
acagacggcg 
gccgtgttgc 
tacaagccca 
cttctcttct 
tattcttatc 
agactgtatt 



ttttttagaa 
tgcaattctg 
ccagcgaagg 
ctctgcaatg 
ggcactgact 
ggagatgata 
cctccaccca 
gcaaagttag 
cagcggtgtc 
ataatcatct 
gacccactgc 
tagttttttg 
ttactgaggg 
caaaattcca 
gtgaaggagt 
ggcaatctgt 
ttatagttat 
ctacacaaac 
ttgccagctc 
ctgctagtac 
ctacctcgag 
gtcccgctac 
acatgcaatg 
tctactacat 
tcgttgtcgg 
cttttacagt 
tgcctcctcc 
gggcccttcg 



ccatgaagat 
acaatgagga 
gtatgctttc 
ctcaaaaagg 
tagtactgct 
ctgagaacat 
ccacaactac 
ccttgcaggt 
cggggctgct 
gcatgttcat 
tgaacctcta 
ttctttgatt 
ggaaaatgtg 
tctagatggg 
taatcttacc 
tagtgttagc 
accactgcct 
aaccacatac 
gtctggggtc 
caatgagcag 
tgccttctct 
tactcctacc 
gcagatcacc 
cttctgccgc 
gcagccggag 
atggtgattg 
aagtctgtgc 
cctcctacgt 



cctaggcctt 
cgttactgtc 
gtggtattgc 
caaaacctca 
caatgtcacg 
gattttttac 
tcacaccaca 
ccaagacagt 
cgtcagcggc 
ttttgcttgc 
tgtttaattt 
ggcactgttt 
acactggtag 
tggaaagaaa 
attgtcaatg 
aatagtgggt 
acacctagcc 
agtacatcaa 
cgagtggcat 
actactgaat 
agcaccgcca 
cccgctattc 
ctgctcattg 
cgcattccca 
ccgcttcagg 
aattatgatt 
caccctcgct 
gctctttgcc 



ttaatttttt 
gttgtcggaa 
tggtttggaa 
aattctaaaa 
aaagcatatg 
aaagtggaag 
cacacagaac 
tcatttgttg 
attgtcggtg 
tgctatagaa 
tttccagagc 
ttagtgttag 
gcgtagaggg 
tttgcacctg 
tcagccaaat 
actataccca 
cacctagcac 
atcagcctac 
ttttgatgtt 
ttttgtccac 
atctatcctc 
tccccactcc 
tgatcgggtt 
acgcgcaccg 
tggaaggggg 
cctagacaaa 
ctggtggcca 
ttcatcacct 



29196 
29256 
29316 
29376 
29436 
29496 
29556 
29616 
29676 
29736 
29796 
29856 
29916 
29976 
30036 
30096 
30156 
30216 
30276 
30336 
30396 
30456 
30516 
30576 
30636 
30696 
30756 
30816 
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gcatctgctg 


ctgtagcata 


gtctgcctgc 


ttatcacctt 


cttccagttc 


attgactgga 


30876 


tctttgtgcg 


catcgcctac 


ctgcgccacc 


acccccagta 


ccgcgaccag 


cgagtggcgc 


30936 


ggctgctcag 


gatcctctga 


taagcatgcg 


ggctctgeta 


cttctcgcgc 


ttctgctgtt 


30996 


agtgctcccc 


cgtcccgtcg 


acccccggac 


ccccacccag 


tcccccgagg 


aggtcegcaa 


31056 


atgcaaattc 


caagaaccct 


ggaaattcct 


caaatgetae 


cgccaaaaat 


cagacatgea 


31116 


tcccagctgg 


atcatgatca 


ttgggatcgt 


gaacattctg 


gcctgcaccc 


tcatctcctt 


31176 


tgtgatttac 


ccctgctttg 


actttggttg 


gaactcgcca 


gaggegctet 


atctcccgcc 


31236 


tgaacctgac 


acaccaccac 


agcaacctca 


ggcacacgca 


ctaccaccac 


caccacagcc 


31296 


taggccacaa 


tacatgccca 


tattagacta 


tgaggecgag 


ccacagcgac 


ccatgctccc 


31356 


cgctattagt 


tacttcaatc 


taaceggegg 


agatgactga 


cccactggcc 


aacaacaacg 


31416 


tcaacgacct 


tctcctggac 


atggacggcc 


gcgcctcgga 


gcagcgactc 


gcccaacttc 


31476 


gcattcgcca 


gcagcaggag 


agagcegtea 


aggagctgea 


ggaeggcata 


gccatccacc 


31536 


agtgcaagaa 


aggcatcttc 


tgcctggtga 


aacaggecaa 


gatctcctac 


gaggtcaccc 


31596 


agaccgacca 


tcgcctctcc 


tacgagctcc 


tgcagcagcg 


ccagaagttc 


acctgcctgg 


31656 


tcggagtcaa 


ccccatcgtc 


atcacccagc 


agtegggega 


taccaagggg 


tgcatccact 


31716 


gctcctgcga 


ctcccccgac 


tgcgtccaca 


ctctgatcaa 


gaccctctgc 


ggcctccgcg 


31776 


acctcctccc 


catgaactaa 


tcaccccctt 


atccagtgaa 


ataaagatca 


tattgatgat 


31836 


ttgagtttaa 


taaaaataaa 


gaatcactta 


cttgaaatct 


gataccaggt 


ctctgtccat 


31896 


gttttctgcc 


aacaccactt 


cactcccctc 


ttcccagctc 


tggtactgea 


ggccccggcg 


31956 


ggctgcaaac 


ttcctccaca 


ccctgaaggg 


gatgtcaaat 


tcctcctgtc 


cctcaatctt 


32016 


cattttatct 


tctatcag atg tec aaa 
Met Ser Lys 


aag cgc gtc egg gtg gat gat gac 
Lys Arg Val Arg Val Asp Asp Asp 


32067 



1465 



1470 



ttc gac ccc gtc tac ccc tac gat gca gac aac gca ccg acc gtg 
Phe Asp Pro Val Tyr Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val 
1475 1480 1485 



32112 



ccc ttc ate aac ccc ccc ttc gtc tct tea gat gga ttc caa gag 
Pro Phe lie Asn Pro Pro Phe Val Ser Ser Asp Gly Phe Gin Glu 
1490 1495 1500 



32157 



aag ccc ctg ggg gtg ctg tec ctg cgt ctg gee gat ccc gtc acc 
Lys Pro Leu Gly Val Leu Ser Leu Arg Leu Ala Asp Pro Val Thr 
1505 1510 1515 



32202 
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acc aag aac ggg gaa ate acc etc aag ctg gga gat ggg gtg gac 32247 

Thr Lys Asn Gly Glu lie Thr Leu Lys Leu Gly Asp Gly Val Asp 
1520 1525 1530 

etc gac tec teg gga aaa etc ate tec aac acg gee acc aag gee 32292 

Leu Asp Ser Ser Gly Lys Leu lie Ser Asn Thr Ala Thr Lys Ala 
1535 1540 1545 

gee gee cct etc agt ttt tec aac aac acc att tec ctt aac atg 32337 

Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr lie Ser Leu Asn Met 
1550 1555 1560 

gat acc cct ttt tac aac aac aat gga aag tta ggc atg aaa gtc 32382 

Asp Thr Pro Phe Tyr Asn Asn Asn Gly Lys Leu Gly Met Lys Val 
1565 1570 1 "* 1575 

act get cca ctg aag ata eta gac aca gac ttg eta aaa aca ctt 32427 

Thr Ala Pro Leu Lys lie Leu Asp Thr Asp Leu Leu Lys Thr Leu 
1580 1585 1590 

gtt gta get tat gga caa ggt tta gga aca aac acc act ggt gee 32472 

Val Val Ala Tyr Gly Gin Gly Leu Gly Thr Asn Thr Thr Gly Ala 
1595 1600 1605 

ctt gtt gee caa eta gca tec cca ctt get ttt gat age aat age 32517 

Leu Val Ala Gin Leu Ala Ser Pro Leu Ala Phe Asp Ser Asn Ser 
1610 1615 1620 

aaa att gee ctt aat tta ggc aat gga cca ttg aaa gtg gat gca 32562 

Lys lie Ala Leu Asn Leu Gly Asn Gly Pro Leu Lys Val Asp Ala 
1625 1630 1635 

aat aga ctg aac ate aat tgc aat aga gga etc tat gtt act acc 32607 

Asn Arg Leu Asn lie Asn Cys Asn Arg Gly Leu Tyr Val Thr Thr 
1640 1645 1650 

aca aaa gat gca ctg gaa gee aat ata agt tgg get aat get atg 32652 

Thr Lys Asp Ala Leu Glu Ala Asn lie Ser Trp Ala Asn Ala Met 
1655 1660 1665 

aca ttt ata gga aat gee atg ggt gtc aat att gat aca caa aaa 32697 

Thr Phe lie Gly Asn Ala Met Gly Val Asn He Asp Thr Gin Lys 
1670 1675 1680 

ggc ttg caa ttt ggc acc act agt acc gtc gca gat gtt aaa aac 32742 

Gly Leu Gin Phe Gly Thr Thr Ser Thr Val Ala Asp Val Lys Asn 
1685 1690 1695 

get tac ccc ata caa ate aaa ctt gga get ggt etc aca ttt gac 32787 

Ala Tyr Pro He Gin He Lys Leu Gly Ala Gly Leu Thr Phe Asp 
1700 1705 1710 

age aca ggt gca att gtt gca tgg aac aaa gat gat gac aag ctt 32832 

Ser Thr Gly Ala lie Val Ala Trp Asn Lys Asp Asp Asp Lys Leu 
1715 1720 1725 
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aca eta tgg acc aca gee gac ccc tct cca aat tgt cac ata tat 32877 
Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys His lie Tyr 
1730 1735 1740 



tct gaa aag gat get aag ctt aca ctt tgc ttg aca aag tgt ggc 32922 
Ser Glu Lys Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly 
1745 1750 1755 



agt cag att ctg ggc act gtt tec etc ata get gtt gat act ggc 32967 

Ser Gin lie Leu Gly Thr Val Ser Leu lie Ala Val Asp Thr Gly 

1760 1765 1770 

agt tta aat ccc ata aca gga aca gta acc act get ctt gtc tea 33012 

Ser Leu Asn Pro lie Thr Gly Thr Val Thr Thr Ala Leu Val Ser 

1775 1780 1785 

ctt aaa ttc gat gca aat gga gtt ttg caa age age tea aca eta 33057 

Leu Lys Phe Asp Ala Asn Gly Val Leu Gin Ser Ser Ser Thr Leu 

1790 1795 1800 



gac tea gac tat tgg aat ttc aga cag gga gat gtt aca cct get 33102 
Asp Ser Asp Tyr Trp Asn Phe Arg Gin Gly Asp Val Thr Pro Ala 
1805 1810 1815 



gaa gee tat act aat get ata ggt ttc atg ccc aat eta aaa gca 33147 
Glu Ala Tyr Thr Asn Ala lie Gly Phe Met Pro Asn Leu Lys Ala 
1820 1825 1830 



tac cct aaa aac aca agt gga get gca aaa agt cac att gtt ggg 33192 
Tyr Pro Lys Asn Thr Ser Gly Ala Ala Lys Ser His lie Val Gly 
1835 1840 1845 

aaa gtg tac eta cat ggg gat aca ggc aaa cca ctg gac etc att 33237 
Lys Val Tyr Leu His Gly Asp Thr Gly Lys Pro Leu Asp Leu lie 
1850 1855 "* 1860 

att act ttc aat gaa aca agt gat gaa tct tgc act tac tgt att 33282 
lie Thr Phe Asn Glu Thr Ser Asp Glu Ser Cys Thr Tyr Cys lie 
1865 1870 1875 

aac ttt caa tgg cag tgg ggg get gat caa tat aaa aat gaa aca 33327 
Asn Phe Gin Trp Gin Trp Gly Ala Asp Gin Tyr Lys Asn Glu Thr 
1880 1885 1890 

ctt gee gtc agt tea ttc acc ttt tec tat att get aaa gaa taa 33372 
Leu Ala Val Ser Ser Phe Thr Phe Ser Tyr lie Ala Lys Glu 
1895 1900 1905 

accccactct gtaccccatc tctgtctatg gaaaaaactc tgaaacacaa aataaaataa 33432 
agttcaagtg ttttattgat tcaacagttt tacaggattc gagcagttat ttttcctcca 33492 
ccctcccagg acatggaata caccaccctc tccccccgca cagecttgaa catctgaatg 33552 
ccattggtga tggacatget tttggtctcc acgttccaca cagtttcaga gcgagccagt 33612 
ctegggtegg tcagggagat gaaaccctcc gggcactccc gcatctgcac ctcacagctc 33672 
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aacagctgag gattgtcctc ggtggtcggg atcacggtta tctggaagaa gcagaagagc 33732 

ggcggtggga atcatagtcc gcgaacggga tcggccggtg gtgtcgcatc aggccccgca 33792 

gcagtcgctg tcgccgccgc tccgtcaagc tgctgctcag ggggtccggg tccagggact 33852 

ccctcagcat gatgcccacg gccctcagca tcagtcgtct ggtgcggcgg gcgcagcagc 33912 

gcatgcggat ctcgctcagg tcgctgcagt acgtgcaaca caggaccacc aggttgttca 33972 

acagtccata gttcaacacg ctccagccga aactcatcgc gggaaggatg ctacccacgt 34032 

ggccgtcgta ccagatcctc aggtaaatca agtggcgccc cctccagaac acgctgccca 34092 

tgtacatgat ctccttgggc atgtggcggt tcaccacctc ccggtaccac atcaccctct 34152 

ggttgaacat gcagccccgg atgatcctgc ggaaccacag ggccagcacc gccccgcccg 34212 

ccatgcagcg aagagacccc gggtcccgac aatggcaatg gaggacccac cgctcgtacc 34272 

cgtggatcat ctgggagctg aacaagtcta tgttggcaca gcacaggcat atgctcatgc 34332 

atctcttcag cactctcagc tcctcggggg tcaaaaccat atcccagggc acggggaact 34392 

cttgcaggac agcgaacccc gcagaacagg gcaatcctcg cacataactt acattgtgca 34452 

tggacagggt atcgcaatca ggcagcaccg ggtgatcctc caccagagaa gcgcgggtct 34512 

cggtctcctc acagcgtggt aagggggccg gccgatacgg gtgatggcgg gacgcggctg 34572 

atcgtgttcg cgaccgtgtt atgatgcagt tgctttcgga cattttcgta cttgctgtag 34632 

cagaacctgg tccgggcgct gcacaccgat cgccggcggc ggtcccggcg cttggaacgc 34692 

tcggtgttga agttgtaaaa cagccactct ctcagaccgt gcagcagatc tagggcctca 34752 

ggagtgatga agatcccatc atgcctgatg gctctaatca catcgaccac cgtggaatgg 34812 

gccagaccca gccagatgat gcaattttgt tgggtttcgg tgacggcggg ggagggaaga 34872 

acaggaagaa ccatgattaa cttttaatcc aaacggtctc ggagcacttc aaaatgaaga 34932 

tcgcggagat ggcacctctc gcccccgctg tgttggtgga aaataacagc caggtcaaag 34992 

gtgatacggt tctcgagatg ttccacggtg gcttccagca aagcctccac gcgcacatcc 35052 

agaaacaaga caatagcgaa agcgggaggg ttctctaatt cctcaatcat catgttacac 35112 

tcctgcacca tccccagata attttcattt ttccagcctt gaatgattcg aactagttcc 35172 

tgaggtaaat ccaagccagc catgataaag agctcgcgca gagcgccctc caccggcatt 35232 

cttaagcaca ccctcataat tccaagatat tctgctcctg gttcacctgc agcagattga 35292 

caagcggaat atcaaaatct ctgccgcgat ccctaagctc ctccctcagc aataactgta 35352 
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agtactcttt 


catatcctct 


ccgaaatttt 


tagccatagg 


accaccagga 


ataagattag 


35412 


ggcaagccac 


agtacagata 


aaccgaagtc 


ctccccagtg 


agcattgcca 


aatgcaagac 


35472 


tgctataagc 


atgctggcta 


gacccggtga 


tatcttccag 


ataactggac 


agaaaatcgc 


35532 


ccaggcaatt 


tttaagaaaa 


tcaacaaaag 


aaaaatcctc 


caggtgcacg 


tttagagcct 


35592 


cgggaacaac 


gatggagtaa 


atgcaagcgg 


tgcgttccag 


catggttagt 


tagctgatct 


35652 


gtagaaaaaa 


acaaaaatga 


acattaaacc 


atgctagcct 


ggcgaacagg 


tgggtaaatc 


35712 


gttctctcca 


gcaccaggca 


ggccacgggg 


tctccggcac 


gaccctcgta 


aaaattgtcg 


35772 


ctatgattga 


aaaccatcac 


agagagacgt 


tcccggtggc 


cggcgtgaat 


gattcgacaa 


35832 


gatgaataca 


cccccggaac 


attggcgtcc 


gcgagtgaaa 


aaaagcgccc 


aaggaagcaa 


35892 


taaggcacta 


caatgctcag 


tctcaagtcc 


agcaaagcga 


tgccatgcgg 


atgaagcaca 


35952 


aaattctcag 


gtgcgtacaa 


aatgtaatta 


ctcccctcct 


gcacaggcag 


caaagccccc 


36012 


gatccctcca 


ggtacacata 


caaagcctca 


gcgtccatag 


cttaccgagc 


agcagcacac 


36072 


aacaggcgca 


agagtcagag 


aaaggctgag 


ctctaacctg 


tccacccgct 


ctctgctcaa 


36132 


tatatagccc 


agatctacac 


tgacgtaaag 


gccaaagtct 


aaaaataccc 


gccaaataat 


36192 


f-i -a <—* y~< ra r~* a c* r+ 
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ccagaaaccg 


gtgacacact 


caaaaaaata 


cgcgcactt c 


q £Z o c; o 


ctcaaacgcc 


caaactgccg 


tcatttccgg 


gttcccacgc 


tacgtcatca 


aaattcgact 


36312 


ttcaaattcc 


gtcgaccgtt 


aaaaacgtcg 


cccgccccgc 


ccctaacggt 


cgccgctccc 


36372 


gcagccaatc 


accgccccgc 


atccccaaat 


tcaaatacct 


catttgcata 


ttaacgcgca 


36432 


ccaaaagttt 


gaggtatatt 


attgatgatg 








36462 



<210> 2 

<211> 530 

<212> PRT 

<213> chimpanzee adenovirus serotype Pan5 

<400> 2 

Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser 
1 5 10 15 

Val Met Gin Gin Ala Val Ala Ala Ala Met Gin Pro Pro Leu Glu Ala 
20 25 30 

Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser 
35 40 45 

lie Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr 
50 55 60 
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Leu Val Asp Asn Lys Ser Ala Asp lie Ala Ser Leu Asn Tyr Gin Asn 
65 70 75 ^ 80 

Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp Phe Thr 
85 90 95 

Pro Thr Glu Ala Ser Thr Gin Thr lie Asn Phe Asp Glu Arg Ser Arg 
100 105 110 

Trp Gly Gly Gin Leu Lys Thr lie Met His Thr Asn Met Pro Asn Val 
115 120 125 

Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val Ser 
130 135 140 

Arg Lys Thr Pro Asn Gly Val Thr Val Thr Asp Gly Ser Gin Asp Glu 
145 150 155 160 

Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly Asn Phe Ser 
165 170 175 

Val Thr Met Thr lie Asp Leu Met Asn Asn Ala lie lie Asp Asn Tyr 
180 185 190 

Leu Ala Val Gly Arg Gin Asn Gly Val Leu Glu Ser Asp He Gly Val 
195 200 205 

Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr Glu 
210 215 220 

Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp He 
225 230 235 240 

Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser Arg Leu Ser 
245 250 255 

Asn Leu Leu Gly He Arg Lys Arg Gin Pro Phe Gin Glu Gly Phe Gin 
260 265 270 

lie Leu Tyr Glu Asp Leu Glu Gly Gly Asn He Pro Ala Leu Leu Asp 
275 280 285 

Val Asp Ala Tyr Glu Lys Ser Lys Glu Asp Ser Ala Ala Ala Ala Thr 
290 295 300 

Ala Ala Val Ala Thr Ala Ser Thr Glu Val Arg Gly Asp Asn Phe Ala 
305 310 315 J 320 

Ser Ala Ala Thr Leu Ala Ala Ala Glu Ala Ala Glu Thr Glu Ser Lys 
325 330 335 

He Val He Gin Pro Val Glu Lys Asp Ser Lys Glu Arg Ser Tyr Asn 
340 345 350 
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Val Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala 
355 360 365 

Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu Leu 
370 375 380 

Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gin Val Tyr Trp Ser Leu 
385 390 395 * 400 

Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg Ser Thr Arg Gin Val 
405 410 ~ 415 

Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys 
420 425 430 

Ser Phe Phe Asn Glu Gin Ala Val Tyr Ser Gin Gin Leu Arg Ala Phe 
435 440 445 

Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gin lie Leu 
450 455 460 

Val Arg Pro Pro Ala Pro Thr lie Thr Thr Val Ser Glu Asn Val Pro 
465 470 475 480 

Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser lie Arg Gly 
485 490 495 

Val Gin Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr 
500 505 510 

Val Tyr Lys Ala Leu Gly Val Val Ala Pro Arg Val Leu Ser Ser Arg 
515 520 525 

Thr Phe 
530 



<210> 3 

<211> 933 

<212> PRT 

<213> chimpanzee adenovirus serotype Pan5 

<400> 3 

Met Ala Thr Pro Ser Met Leu Pro Gin Trp Ala Tyr Met His lie Ala 
1 5 10 15 

Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 
20 25 30 

Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 
35 40 " 45 

Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 
50 55 60 
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Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr 
65 70 75 ^ 80 

Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 
85 90 95 

Ala Ser Thr Tyr Phe Asp lie Arg Gly Val Leu Asp Arg Gly Pro Ser 
100 105 110 

Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 
115 120 125 

Ala Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Asp Gly Asp Thr Gly 
130 135 140 

Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gin Gly lie Ser 
145 150 155 ~ 160 

lie Thr Lys Asp Gly lie Gin Leu Gly Thr Asp Thr Asp Asp Gin Pro 
165 170 ~ 175 

lie Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin Val Gly Asp Ala 
180 185 190 

Glu Trp His Asp lie Thr Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala 
195 200 205 

Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys 
210 215 220 

Pro Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys Thr Glu Thr Gly 
225 230 235 ' 240 

Gly Thr Lys Glu Tyr Asp He Asp Met Ala Phe Phe Asp Asn Arg Ser 
245 250 255 

Ala Ala Ala Ala Gly Leu Ala Pro Glu He Val Leu Tyr Thr Glu Asn 
260 265 270 

Val Asp Leu Glu Thr Pro Asp Thr His He Val Tyr Lys Ala Gly Thr 
275 280 285 

Asp Asp Ser Ser Ser Ser He Asn Leu Gly Gin Gin Ser Met Pro Asn 
290 295 300 

Arg Pro Asn Tyr He Gly Phe Arg Asp Asn Phe He Gly Leu Met Tyr 
305 310 315 * 320 

Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin 
325 330 335 

Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr 
340 345 350 

Gin Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met 
355 360 365 
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Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg lie He Glu 
370 375 380 

Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp 
385 390 395 ~ 400 

Ala Val Gly Arg Thr Asp Thr Tyr Gin Gly He Lys Ala Asn Gly Ala 
405 410 415 

Asp Gin Thr Thr Trp Thr Lys Asp Asp Thr Val Asn Asp Ala Asn Glu 
420 425 430 

Leu Gly Lys Gly Asn Pro Phe Ala Met Glu He Asn He Gin Ala Asn 
435 440 445 

Leu Trp Arg Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp 
450 455 460 

Ser Tyr Lys Tyr Thr Pro Ala Asn He Thr Leu Pro Thr Asn Thr Asn 
465 470 475 480 

Thr Tyr Asp Tyr Met Asn Gly Arg Val Val Ala Pro Ser Leu Val Asp 
485 490 495 

Ala Tyr He Asn He Gly Ala Arg Trp Ser Leu Asp Pro Met Asp Asn 
500 505 510 

Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser 
515 52 0 525 

Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His He Gin Val Pro 
530 535 540 

Gin Lys Phe Phe Ala He Lys Ser Leu Leu Leu Leu Pro Gly Ser Tyr 
545 ■ 550 555 "* 560 

Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met He Leu Gin Ser 
565 570 575 

Ser Leu Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser He Ala Phe Thr 
580 585 590 

Ser He Asn Leu Tyr Ala Thr Phe Phe Pro Met Ala His Asn Thr Ala 
595 600 605 

Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser Phe 
610 615 620 

Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro He Pro Ala Asn 
625 630 635 640 

Ala Thr Asn Val Pro He Ser He Pro Ser Arg Asn Trp Ala Ala Phe 
645 650 655 

Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr Arg Glu Thr Pro Ser Leu 
660 665 670 
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Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser lie Pro Tyr 
675 68 0 685 

Leu Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser lie 
690 695 700 

Thr Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr 
705 710 ~ 715 ~ 720 

Pro Asn Glu Phe Glu lie Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn 
725 730 ^ " 735 

Val Ala Gin Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gin Met Leu 
740 745 750 

Ala His Tyr Asn lie Gly Tyr Gin Gly Phe Tyr Val Pro Glu Gly Tyr 
755 760 765 

Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg 
770 775 780 

Gin Val Val Asp Glu Val Asn Tyr Lys Asp Tyr Gin Ala Val Thr Leu 
785 790 ~ 795 800 

Ala Tyr Gin His Asn Asn Ser Gly Phe Val Gly Tyr Leu Ala Pro Thr 
805 810 815 

Met Arg Gin Gly Gin Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu lie 
820 825 830 

Gly Lys Ser Ala Val Ala Ser Val Thr Gin Lys Lys Phe Leu Cys Asp 
835 840 ' " 845 

Arg Val Met Trp Arg lie Pro Phe Ser Ser Asn Phe Met Ser Met Gly 
850 855 860 

Ala Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala Asn Ser Ala His 
865 870 875 880 

Ala Leu Asp Met Asn Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu 
885 890 895 

Leu Tyr Val Val Phe Glu Val Phe Asp Val Val Arg Val His Gin Pro 
900 905 910 

His Arg Gly Val lie Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala 
915 920 925 

Gly Asn Ala Thr Thr 
930 



<210> 4 

<211> 445 

<212> PRT 

<213> chimpanzee adenovirus serotype Pan5 
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<400> 4 

Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr 
1 5 10 15 

Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe lie Asn Pro Pro 
20 25 30 

Phe Val Ser Ser Asp Gly Phe Gin Glu Lys Pro Leu Gly Val Leu Ser 
35 40 45 

Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu lie Thr Leu 
50 55 60 

Lys Leu Gly Asp Gly Val Asp Leu Asp Ser Ser Gly Lys Leu lie Ser 
65 70 75 80 

Asn Thr Ala Thr Lys Ala Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr 
85 90 95 

lie Ser Leu Asn Met Asp Thr Pro Phe Tyr Asn Asn Asn Gly Lys Leu 
100 105 110 

Gly Met Lys Val Thr Ala Pro Leu Lys He Leu Asp Thr Asp Leu Leu 
115 120 125 

Lys Thr Leu Val Val Ala Tyr Gly Gin Gly Leu Gly Thr Asn Thr Thr 
130 135 140 

Gly Ala Leu Val Ala Gin Leu Ala Ser Pro Leu Ala Phe Asp Ser Asn 
145 150 155 160 

Ser Lys He Ala Leu Asn Leu Gly Asn Gly Pro Leu Lys Val Asp Ala 
165 170 175 

Asn Arg Leu Asn He Asn Cys Asn Arg Gly Leu Tyr Val Thr Thr Thr 
180 185 190 

Lys Asp Ala Leu Glu Ala Asn He Ser Trp Ala Asn Ala Met Thr Phe 
195 200 205 

He Gly Asn Ala Met Gly Val Asn He Asp Thr Gin Lys Gly Leu Gin 
210 215 220 

Phe Gly Thr Thr Ser Thr Val Ala Asp Val Lys Asn Ala Tyr Pro He 
225 230 235 240 

Gin He Lys Leu Gly Ala Gly Leu Thr Phe Asp Ser Thr Gly Ala He 
245 250 ~ 255 

Val Ala Trp Asn Lys Asp Asp Asp Lys Leu Thr Leu Trp Thr Thr Ala 
260 265 ~ 270 

Asp Pro Ser Pro Asn Cys His He Tyr Ser Glu Lys Asp Ala Lys Leu 
275 280 285 
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Thr Leu Cys Leu 
290 

Leu He Ala Val 
305 

Thr Thr Ala Leu 



Ser Ser Ser Thr 
340 

Val Thr Pro Ala 
355 

Leu Lys Ala Tyr 
37 0 

Val Gly Lys Val 
385 

He lie Thr Phe 



Asn Phe Gin Trp 
420 

Ala Val Ser Ser 
435 



Thr Lys Cys Gly 
295 

Asp Thr Gly Ser 
310 

Val Ser Leu Lys 
325 

Leu Asp Ser Asp 



Glu Ala Tyr Thr 
360 

Pro Lys Asn Thr 
375 

Tyr Leu His Gly 
390 

Asn Glu Thr Ser 
405 

Gin Trp Gly Ala 



Phe Thr Phe Ser 
440 



Ser Gin lie Leu 
300 

Leu Asn Pro lie 
315 

Phe Asp Ala Asn 
330 

Tyr Trp Asn Phe 
345 

Asn Ala He Gly 



Ser Gly Ala Ala 
380 

Asp Thr Gly Lys 
395 

Asp Glu Ser Cys 
410 

Asp Gin Tyr Lys 
425 

Tyr He Ala Lys 



Gly Thr Val Ser 



Thr Gly Thr Val 
320 

Gly Val Leu Gin 
335 

Arg Gin Gly Asp 
350 

Phe Met Pro Asn 
365 

Lys Ser His He 



Pro Leu Asp Leu 
400 

Thr Tyr Cys He 
415 

Asn Glu Thr Leu 
430 

Glu 
445 



<210> 5 

<211> 36604 

<212> DNA 

<213> chimpanzee adenovirus serotype Pan6 
<220> 

<221> CDS 

<222> (13878) . . (15467) 

<223> L2 Penton 

<220> 

<221> CDS 

<222> (18284) . . (21112) 

<223> L3 Hexon 

<220> 

<221> CDS 

<222> (32162) . . (33493) 

<223> L5 Fiber 

<400> 5 

catcatcaat aatatacctc aaacttttgg tgcgcgttaa tatgcaaatg agctgtttga 60 
atttggggag ggaggaaggt gattggctgc gggagcggcg accgttaggg gcggggcggg 12 0 
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tgacgttttg 


atgacgtggc 


tatgaggcgg 


agccggtttg 


caagttctcg 


tgggaaaagt 


180 


gacgtcaaac 


gaggtgtggt 


ttgaacacgg 


aaatactcaa 


ttttcccgcg 


ctctctgaca 


240 


ggaaatgagg tgtttctggg 


cggatgcaag 


tgaaaacggg 


ccattttcgc 


gcgaaaactg 


300 


aatgaggaag 


tgaaaatctg 


agtaatttcg 


cgtttatggc 


agggaggagt 


atttgccgag 


360 


ggccgagtag 


actttgaccg 


attacgtggg 


ggtttcgatt 


accgtatttt 


tcacctaaat 


420 


ttccgcgtac 


ggtgtcaaag 


tccggtgttt 


ttacgtaggc 


gtcagctgat 


cgccagggta 


480 


tttaaacctg 


cgctctctag 


tcaagaggcc 


actcttgagt 


gccagcgagt 


agagttttct 


540 


cctccgcgcc 


gcgagtcaga 


tctacacttt 


gaaagatgag 


gcacctgaga 


gacctgcccg 


600 


gtaatgtttt 


cctggctact 


gggaacgaga 


ttctggaatt 


ggtggtggac 


gccatgatgg 


660 


gtgacgaccc 


tccagagccc 


cctaccccat 


ttgaggcgcc 


ttcgctgtac 


gatttgtatg 


720 


atctggaggt 


ggatgtgccc 


gagagcgacc 


ctaacgagga 


ggcggtgaat 


gatttgttta 


780 


gcgatgccgc 


gctgctggct 


gccgagcagg 


ctaatacgga 


ctctggctca 


gacagcgatt 


840 


cctctctcca 


taccccgaga 


cccggcagag 


gtgagaaaaa 


gatccccgag 


cttaaagggg 


900 


aagagctcga 


cctgcgctgc 


tatgaggaat 


gcttgcctcc 


gagcgatgat 


gaggaggacg 


960 


aggaggcgat 


tcgagctgcg 


gtgaaccagg 


gagtgaaaac 


tgcgggcgag 


agctttagcc 


1020 


tggactgtcc 


tactctgccc 


ggacacggct 


gtaagtcttg 


tgaatttcat 


cgcatgaata 


1080 


ctggagataa 


gaatgtgatg 


tgtgccctgt 


gctatatgag 


agcttacaac 


cattgtgttt 


1140 


acagtaagtg 


tgattaactt 


tagttgggaa 


ggcagagggt 


gactgggtgc 


tgactggttt 


1200 


atttatgtat 


atgttttttt 


atgtgtaggt 


cccgtctctg 


acgtagatga 


gacccccact 


1260 


tcagagtgca 


tttcatcacc 


cccagaaatt 


ggcgaggaac 


cgcccgaaga 


tattattcat 


1320 


agaccagttg 


cagtgagagt 


caccgggcgg 


agagcagctg 


tggagagttt 


ggatgacttg 


1380 


ctacagggtg 


gggatgaacc 


tttggacttg 


tgtacccgga 
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1440 
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gtgtttactt 


aaggtgatgt 


cagtatttat 


agggtgtgga 


gtgcaataaa 
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atccgtgttg 


actttaagtg 


cgtgttttat 


gactcagggg 


tggggactgt 


gggtatataa 
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tggagatctg 
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gaagactttc 
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acagttgcta 


gagaactcat 


cggagggagt 


ctcttacctg 
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ttgacttttc 


tactcctggc 


1860 
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agaactaccg 


ccgcggtagc 


cttttttgcc 


tttattcttg 


acaaatggag 


tcaagaaacc 


1920 


catttcagca 


gggattaccg 


tctggactgc 


ttagcagtag 


ctttgtggag 


aacatggagg 


1980 


tgccagcgcc 


tgaatgcaat 


ctccggctac 


ttgccagtac 


agccggtaga 
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2040 


atcctgagtc 


tccagtcacc 


ccaggaacac 


caacgccgcc 


agcagccgca 


gcaggagcag 


2100 


cagcaagagg 


aggaccgaga 


agagaacccg 


agagccggtc 


tggaccctcc 


ggtggcggag 


2160 


gaggaggagt 


agctgacttg 


tttcccgagc 


tgcgccgggt 


gctgactagg 


tcttccagtg 


2220 


gacgggagag 


ggggattaag 


cgggagaggc 


atgaggagac 


tagccacaga 


actgaactga 


2280 


ctgtcagtct 


gatgagccgc 


aggcgcccag 


aatcggtgtg 


gtggcatgag 


gtgcagtcgc 


2340 


aggggataga 


tgaggtctcg 


gtgatgcatg 


agaaatattc 


cctagaacaa 


gtcaagactt 


2400 


gttggttgga 


gcccgaggat 


gattgggagg 


tagccatcag 


gaattatgcc 


aagctggctc 


2460 


tgaagccaga 


caagaagtac 


aagattacca 


aactgattaa 


tatcagaaat 


tcctgctaca 


2520 


tttcagggaa 


tggggccgag 


gtggagatca 


gtacccagga 


gagggtggcc 


ttcagatgtt 


2580 


gtatgatgaa 


tatgtacccg 


ggggtggtgg 


gcatggaggg 


agtcaccttt 


atgaacacga 


2640 


ggttcagggg 


tgatgggtat 


aatggggtgg 


tctttatggc 


caacaccaag 


ctgacagtgc 


2700 


acggatgctc 


cttctttggc 


ttcaataaca 


tgtgcatcga 


ggcctggggc 


agtgtttcag 


2760 


tgaggggatg 


cagcttttca 


gccaactgga 


tgggggtcgt 


gggcagaacc 


aagagcaagg 


2820 


tgtcagtgaa 


gaaatgcctg 


ttcgagaggt 


gccacctggg 


ggtgatgagc 


gagggcgaag 


2880 


ccaaagtcaa 


acactgcgcc 


tctaccgaga 


cgggctgctt 


tgtgctgatc 


aagggcaatg 


2940 


cccaagtcaa 


gcataacatg 


atctgtgggg 


cctcggatga 


gcgcggctac 


cagatgctga 


3000 


cctgcgccgg 


tgggaacagc 


catatgctgg 


ccaccgtgca 


tgtggcctcg 


cacccccgca 


3060 


agacatggcc 


cgagttcgag 


cacaacgtca 


tgacccgctg 


caatgtgcac 


ctgggctccc 


3120 


gccgaggcat 


gttcatgccc 


taccagtgca 


acatgcaatt 


tgtgaaggtg 


ctgctggagc 


3180 


ccgatgccat 


gtccagagtg 


agcctgacgg 


gggtgtttga 


catgaatgtg 


gagctgtgga 


3240 


aaattctgag 


atatgatgaa 


tccaagacca 


ggtgccgggc 


ctgcgaatgc 


ggaggcaagc 


3300 


acgccaggct 


tcagcccgtg 


tgtgtggagg 


tgacggagga 


cctgcgaccc 


gatcatttgg 


3360 


tgttgtcctg 


caacgggacg 


gagttcggct 


ccagcgggga 


agaatctgac 


tagagtgagt 


3420 


agtgtttggg 


gctgggtgtg 


agcctgcatg 


aggggcagaa 


tgactaaaat 


ctgtggtttt 


3480 


ctgtgtgttg 


cagcagcatg 


agcggaagcg 


cctcctttga 


gggaggggta 


ttcagccctt 


3540 
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atgggcgtcc 


agcgaggcca 


gggtccggtc 
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gctcatccgg 


ctggtcgaga 


accgctcccg 


gtcggcgccc 


tgcgcgtcgg 


ccaggtagca 


5340 


attgagcatg 


agttcgtagt 


tgagcgcctc 
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cccttggcgc 


ggagcttacc 
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ttggatgatg tcatacttga gctggccctt ctgcttccac agctcgcggt tgagaaggaa 7020 
ctcttcgcgg tcottccagt actcttcgag ggggaacccg tcctgatcgg cacggtaaga 7080 
gcccaccatg tagaactggt tgacggcctt gtaggcgcag cagcccttct ccacggggag 7140 
ggcgtaagct tgcgcggcct tgcgcaggga ggtgtgggtg agggcgaagg tgtcgcgcac 72 0 0 
catgaccttg aggaactggt gcttgaagtc gaggtcgtcg cagccgccct gctcccagag 7260 
ttggaagtcc gtgcgcttct tgtaggcggg gttaggcaaa gcgaaagtaa catcgttgaa 7320 

gaggatcttg cccgcgcggg gcatgaagtt gcgagtgatg cggaaaggct ggggcacctc 7380 

ggcccggttg ttgatgacct gggcggcgag gacgatctcg tcgaagccgt tgatgttgtg 7440 

cccgacgatg tagagttcca cgaatcgcgg gcggcccttg acgtggggca gcttcttgag 7500 

ctcgtcgtag gtgagctcgg cggggtcgct gagcccgtgc tgctcgaggg cccagtcggc 7560 

gacgtggggg ttggcgctga ggaaggaagt ccagagatcc acggccaggg cggtctgcaa 7620 

gcggtcccgg tactgacgga actgttggcc cacggccatt ttttcggggg tgacgcagta 7680 

gaaggtgcgg gggtcgccgt gccagcggtc ccacttgagc tggagggcga ggtcgtgggc 774 0 

gagctcgacg agcggcgggt ccccggagag tttcatgacc agcatgaagg ggacgagctg 7800 

cttgccgaag gaccccatcc aggtgtaggt ttccacatcg taggtgagga agagcctttc 7860 

ggtgcgagga tgcgagccga tggggaagaa ctggatctcc tgccaccagt tggaggaatg 792 0 

gctgttgatg tgatggaagt agaaatgccg acggcgcgcc gagcactcgt gcttgtgttt 7980 

atacaagcgt ccgcagtgct cgcaacgctg cacgggatgc acgtgctgca cgagctgtac 8040 

ctgggttcct ttggcgagga atttcagtgg gcagtggagc gctggcggct gcatctcgtg 8100 

ctgtactacg tcttggccat cggcgtggcc atcgtctgcc tcgatggtgg tcatgctgac 8160 

gagcccgcgc gggaggcagg tccagacctc ggctcggacg ggtcggagag cgaggacgag 8220 

ggcgcgcagg ccggagctgt ccagggtcct gagacgctgc ggagtcaggt cagtgggcag 8280 

cggcggcgcg cggttgactt gcaggagctt ttccagggcg cgcgggaggt ccagatggta 8340 

cttgatctcc acggcgccgt tggtggctac gtccacggct tgcagggtgc cgtgcccctg 8400 

gggcgccacc accgtgcccc gtttcttctt gggcgctgct tccatgtcgg tcagaagcgg 8460 

cggcgaggac gcgcgccggg cggcaggggc ggctcggggc ccggaggcag gggcggcagg 8520 

ggcacgtcgg cgccgcgcgc gggcaggttc tggtactgcg cccggagaag actggcgtga 8580 

gcgacgacgc gacggttgac gtcctggatc tgacgcctct gggtgaaggc cacgggaccc 8 64 0 

gtgagtttga acctgaaaga gagttcgaca gaatcaatct cggtatcgtt gacggcggcc 8700 
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tgccgcagga tctcttgcac gtcgcccgag ttgtcctggt aggcgatctc ggtcatgaac 8760 

tgctcgatct cctcctcctg aaggtctccg cggccggcgc gctcgacggt ggccgcgagg 8820 

tcgttggaga tgcggcccat gagctgcgag aaggcgttca tgccggcctc gttccagacg 8880 

cggctgtaga ccacggctcc gtcggggtcg cgcgcgcgca tgaccacctg ggcgaggttg 8940 

agctcgacgt ggcgcgtgaa gaccgcgtag ttgcagaggc gctggtagag gtagttgagc 9000 

gtggtggcga tgtgctcggt gacgaagaag tacatgatcc agcggcggag cggcatctcg 9060 

ctgacgtcgc ccagggcttc caagcgttcc atggcctcgt agaagtccac ggcgaagttg 9120 

aaaaactggg agttgcgcgc cgagacggtc aactcctcct ccagaagacg gatgagctcg 9180 

gcgatggtgg cgcgcacctc gcgctcgaag gccccggggg gctcctcttc catctcctcc 9240 

tcttcctcct ccactaacat ctcttctact tcctcctcag gaggcggtgg cgggggaggg 9300 

gccctgcgtc gccggcggcg cacgggcaga cggtcgatga agcgctcgat ggtctccccg 9360 

cgccggcgac gcatggtctc ggtgacggcg cgcccgtcct cgcggggccg cagcatgaag 9420 

acgccgccgc gcatctccag gtggccgccg ggggggtctc cgttgggcag ggagagggcg 948 0 

ctgacgatgc atcttatcaa ttgacccgta gggactccgc gcaaggacct gagcgtctcg 9540 

agatccacgg gatccgaaaa ccgctgaacg aaggcttcga gccagtcgca gtcgcaaggt 9600 

aggctgagcc cggtttcttg ttcttcgggt atttggtcgg gaggcgggcg ggcgatgctg 9660 

ctggtgatga agttgaagta ggcggtcctg agacggcgga tggtggcgag gagcaccagg 9720 

tccttgggcc cggcttgctg gatgcgcaga cggtcggcca tgccccaggc gtggtcctga 9780 

cacctggcga ggtccttgta gtagtcctgc atgagccgct ccacgggcac ctcctcctcg 9840 

cccgcgcggc cgtgcatgcg cgtgagcccg aacccgcgct gcggctggac gagcgccagg 9900 

tcggcgacga cgcgctcggt gaggatggcc tgctggatct gggtgagggt ggtctggaag 9960 

tcgtcgaagt cgacgaagcg gtggtaggct ccggtgttga tggtgtagga gcagttggcc 10020 

atgacggacc agttgacggt ctggtggccg ggtcgcacga gctcgtggta cttgaggcgc 10080 

gagtaggcgc gcgtgtcgaa gatgtagtcg ttgcaggcgc gcacgaggta ctggtatccg 1014 0 

acgaggaagt gcggcggcgg ctggcggtag agcggccatc gctcggtggc gggggcgccg 10200 

ggcgcgaggt cctcgagcat gaggcggtgg tagccgtaga tgtacctgga catccaggtg 10260 

atgccggcgg cggtggtgga ggcgcgcggg aactcgcgga cgcggttcca gatgttgcgc 10320 

agcggcagga agtagttcat ggtggccgcg gtctggcccg tgaggcgcgc gcagtcgtgg 1038 0 
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atgctctaga catacgggca aaaacgaaag cggtcagcgg ctcgactccg tggcctggag 10440 
gctaagcgaa cgggttgggc tgcgcgtgta ccccggttcg aatctcgaat caggctggag 10500 
ccgcagctaa cgtggtactg gcactcccgt ctcgacccaa gcctgctaac gaaacctcca 10560 
ggatacggag gcgggtcgtt ttttggcctt ggtcgctggt catgaaaaac tagtaagcgc 10620 
ggaaagcggc cgcccgcgat ggctcgctgc cgtagtctgg agaaagaatc gccagggttg 10680 
cgttgcggtg tgccccggtt cgagcctcag cgctcggcgc cggccggatt ccgcggctaa 10740 
cgtgggcgtg gctgccccgt cgtttccaag accccttagc cagccgactt ctccagttac 10800 
ggagcgagcc cctctttttt tttcttgtgt ttttgccaga tgcatcccgt actgcggcag 10860 
atgcgccccc accctccacc acaaccgccc ctaccgcagc agcagcaaca gccggcgctt 10920 
ctgcccccgc cccagcagca gccagccact accgcggcgg ccgccgtgag cggagccggc 10980 
gttcagtatg acctggcctt ggaagagggc gaggggctgg cgcggctggg ggcgtcgtcg 11040 
ccggagcggc acccgcgcgt gcagatgaaa agggacgctc gcgaggccta cgtgcccaag 11100 
cagaacctgt tcagagacag gagcggcgag gagcccgagg agatgcgcgc ctcccgcttc 11160 
cacgcggggc gggagctgcg gcgcggcctg gaccgaaagc gggtgctgag ggacgaggat 11220 
ttcgaggcgg acgagctgac ggggatcagc cccgcgcgcg cgcacgtggc cgcggccaac 11280 
ctggtcacgg cgtacgagca gaccgtgaag gaggagagca acttccaaaa atccttcaac 11340 
aaccacgtgc gcacgctgat cgcgcgcgag gaggtgaccc tgggcctgat gcacctgtgg 11400 
gacctgctgg aggccatcgt gcagaacccc acgagcaagc cgctgacggc gcagctgttt 11460 
ctggtggtgc agcacagtcg ggacaacgag acgttcaggg aggcgctgct gaatatcacc 11520 
gagcccgagg gccgctggct cctggacctg gtgaacattt tgcagagcat cgtggtgcag 11580 
gagcgcgggc tgccgctgtc cgagaagctg gcggccatca acttctcggt gctgagtctg 11640 
ggcaagtact acgctaggaa gatctacaag accccgtacg tgcccataga caaggaggtg 11700 
aagatcgacg ggttttacat gcgcatgacc ctgaaagtgc tgaccctgag cgacgatctg 11760 
ggggtgtacc gcaacgacag gatgcaccgc gcggtgagcg ccagccgccg gcgcgagctg 11820 
agcgaccagg agctgatgca cagcctgcag cgggccctga ccggggccgg gaccgagggg 11880 
gagagctact ttgacatggg cgcggacctg cgctggcagc ccagccgccg ggccttggaa 11940 
gctgccggcg gttcccccta cgtggaggag gtggacgatg aggaggagga gggcgagtac 12000 
ctggaagact gatggcgcga ccgtattttt gctagatgca gcaacagcca ccgccgccgc 12060 
ctcctgatcc cgcgatgcgg gcggcgctgc agagccagcc gtccggcatt aactcctcgg 12120 
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aatctgaaaa aataaaagac ggtactcacc aaggccatgg cgaccagcgt gcgttcttct 13860 

ctgttgtttg tagtagt atg atg agg cgc gtg tac ccg gag ggt cct cct 13910 

Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro 
15 10 

ccc teg tac gag age gtg atg cag cag gcg gtg gcg gcg gcg atg cag 13958 
Pro Ser Tyr Glu Ser Val Met Gin Gin Ala Val Ala Ala Ala Met Gin 
15 20 25 

ccc ccg ctg gag gcg cct tac gtg ccc ccg egg tac ctg gcg cct acg 14006 
Pro Pro Leu Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr 
30 35 40 

gag ggg egg aac age att cgt tac teg gag ctg gca ccc ttg tac gat 14054 
Glu Gly Arg Asn Ser lie Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp 
45 ' 50 55 

ace ace egg ttg tac ctg gtg gac aac aag teg gca gac ate gec teg 14102 
Thr Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp lie Ala Ser 
60 65 70 75 

ctg aac tac cag aac gac cac age aac ttc ctg ace acc gtg gtg cag 14150 
Leu Asn Tyr Gin Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gin 
80 85 90 

aac aac gat ttc acc ccc acg gag gee age acc cag acc ate aac ttt 14198 
Asn Asn Asp Phe Thr Pro Thr Glu Ala Ser Thr Gin Thr lie Asn Phe 
95 100 105 

gac gag cgc teg egg tgg ggc ggc cag ctg aaa acc ate atg cac acc 14246 
Asp Glu Arg Ser Arg Trp Gly Gly Gin Leu Lys Thr He Met His Thr 
110 115 120 

aac atg ccc aac gtg aac gag ttc atg tac age aac aag ttc aag gcg 14294 
Asn Met Pro Asn Val Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala 
125 130 135 

egg gtg atg gtc teg cgc aag acc ccc aac ggg gtg gat gat gat tat 14342 
Arg Val Met Val Ser Arg Lys Thr Pro Asn Gly Val Asp Asp Asp Tyr 
140 145 150 155 

gat ggt agt cag gac gag ctg acc tac gag tgg gtg gag ttt gag ctg 14390 
Asp Gly Ser Gin Asp Glu Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu 
160 165 170 

ccc gag ggc aac ttc teg gtg acc atg acc ate gat ctg atg aac aac 14438 
Pro Glu Gly Asn Phe Ser Val Thr Met Thr He Asp Leu Met Asn Asn 
175 180 185 

gec ate ate gac aac tac ttg gcg gtg ggg egg cag aac ggg gtg ctg 14486 
Ala He He Asp Asn Tyr Leu Ala Val Gly Arg Gin Asn Gly Val Leu 
190 ^ 195 200 

gag age gac ate ggc gtg aag ttc gac acg cgc aac ttc egg ctg ggc 14534 
Glu Ser Asp He Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly 
205 210 215 
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tgg gac ccc gtg acc gag ctg gtg atg ccg ggc gtg tac acc aac gag 14582 
Trp Asp Pro Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu 
220 225 230 ~ 235 

gcc ttc cac ccc gac ate gtc ctg ctg ccc ggc tgc ggc gtg gac ttc 14630 
Ala Phe His Pro Asp lie Val Leu Leu Pro Gly Cys Gly Val Asp Phe 
240 245 ~ " ~ 250 

acc gag age cgc etc age aac ctg ctg ggc ate cgc aag egg cag ccc 14678 
Thr Glu Ser Arg Leu Ser Asn Leu Leu Gly lie Arg Lys Arg Gin Pro 
255 260 ~ 265 

ttc cag gag ggc ttc cag ate ctg tac gag gac ctg gag ggg ggc aac 14726 
Phe Gin Glu Gly Phe Gin lie Leu Tyr Glu Asp Leu Glu Gly Gly Asn 
270 275 " 280 

ate ccc gcg etc ttg gat gtc gaa gcc tac gag aaa age aag gag gat 14774 
lie Pro Ala Leu Leu Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu Asp 
285 290 295 

age acc gcc gcg gcg acc gca gcc gtg gcc acc gcc tct acc gag gtg 14822 
Ser Thr Ala Ala Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val 
300 305 310 315 

egg ggc gat aat ttt get age get gcg gca gcg gcc gag gcg get gaa 14870 
Arg Gly Asp Asn Phe Ala Ser Ala Ala Ala Ala Ala Glu Ala Ala Glu 
320 325 330 

acc gaa agt aag ata gtc ate cag ccg gtg gag aag gac age aag gac 14918 
Thr Glu Ser Lys lie Val He Gin Pro Val Glu Lys Asp Ser Lys Asp 
335 340 345 

agg age tac aac gtg etc gcg gac aag aaa aac acc gcc tac cgc age 14966 
Arg Ser Tyr Asn Val Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg Ser 
35 0 355 3 60 

tgg tac ctg gcc tac aac tac ggc gac ccc gag aag ggc gtg cgc tec 15014 
Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser 
365 370 375 



tgg acg ctg etc acc acc teg gac gtc acc tgc ggc gtg gag caa gtc 
Trp Thr Leu Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gin Val 
380 385 390 



395 



15062 



tac tgg teg ctg ccc gac atg atg caa gac ccg gtc acc ttc cgc tec 15110 

Tyr Trp Ser Leu Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg Ser 
400 405 410 

acg cgt caa gtt age aac tac ccg gtg gtg ggc gcc gag etc ctg ccc 15158 

Thr Arg Gin Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro 
415 420 425 

gtc tac tec aag age ttc ttc aac gag cag gcc gtc tac teg cag cag 15206 

Val Tyr Ser Lys Ser Phe Phe Asn Glu Gin Ala Val Tyr Ser Gin Gin 

430 435 440 
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ctg cgc gcc ttc acc teg etc acg cac gtc ttc aac cgc ttc ccc gag 15254 

Leu Arg Ala Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu 
445 450 455 

aac cag ate etc gtc cgc ccg ccc gcg ccc acc att acc acc gtc agt 15302 

Asn Gin He Leu Val Arg Pro Pro Ala Pro Thr He Thr Thr Val Ser 

460 465 470 475 

gaa aac gtt cct get etc aca gat cac ggg acc ctg ccg ctg cgc age 15350 

Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser 

480 485 490 

agt ate egg gga gtc cag cgc gtg acc gtc act gac gcc aga cgc cgc 15398 

Ser He Arg Gly Val Gin Arg Val Thr Val Thr Asp Ala Arg Arg Arg 

495 500 505 

acc tgc ccc tac gtc tac aag gcc ctg ggc gta gtc gcg ccg cgc gtc 15446 

Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Arg Val 
510 515 520 

etc teg age cgc acc ttc taa aaaatgtcca ttctcatctc geccagtaat 15497 
Leu Ser Ser Arg Thr Phe 



525 
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ccacgccgag cctgaagccg gtgaccctgc agcaggtgct accgagcgcg gcgccgcgcc 16577 
ggggcttcaa gcgcgagggc ggcgaggatc tgtacccgac catgcagctg atggtgccca 16637 
agcgccagaa gctggaggac gtgctggagc acatgaaggt ggaccccgag gtgcagcccg 16697 
aggtcaaggt gcggcccatc aagcaggtgg ccccgggcct gggcgtgcag accgtggaca 16757 
tcaagatccc cacggagccc atggaaacgc agaccgagcc cgtgaagccc agcaccagca 16817 
ccatggaggt gcagacggat ccctggatgc cagcaccagc ttccaccagc actcgccgaa 16877 
gacgcaagta cggcgcggcc agcctgctga tgcccaacta cgcgctgcat ccttccatca 16937 
tccccacgcc gggctaccgc ggcacgcgct tctaccgcgg ctacaccagc agccgccgcc 16997 
gcaagaccac cacccgccgc cgtcgtcgca gccgccgcag cagcaccgcg acttccgcct 17057 
tggtgcggag agtgtatcgc agcgggcgcg agcctctgac cctgccgcgc gcgcgctacc 17117 
acccgagcat cgccatttaa ctaccgcctc ctacttgcag atatggccct cacatgccgc 17177 
ctccgcgtcc ccattacggg ctaccgagga agaaagccgc gccgtagaag gctgacgggg 17237 
aacgggctgc gtcgccatca ccaccggcgg cggcgcgcca tcagcaagcg gttgggggga 17297 
ggcttcctgc ccgcgctgat ccccatcatc gccgcggcga tcggggcgat ccccggcata 17357 
gcttccgtgg cggtgcaggc ctctcagcgc cactgagaca caaaaaagca tggatttgta 17417 
ataaaaaaaa aaatggactg acgctcctgg tcctgtgatg tgtgttttta gatggaagac 17477 
atcaattttt cgtccctggc accgcgacac ggcacgcggc cgtttatggg cacctggagc 17537 
gacatcggca acagccaact gaacgggggc gccttcaatt ggagcagtct ctggagcggg 17597 
cttaagaatt tcgggtccac gctcaaaacc tatggcaaca aggcgtggaa cagcagcaca 17657 
gggcaggcgc tgagggaaaa gctgaaagaa cagaacttcc agcagaaggt ggttgatggc 17717 
ctggcctcag gcatcaacgg ggtggttgac ctggccaacc aggccgtgca gaaacagatc 17777 
aacagccgcc tggacgcggt cccgcccgcg gggtccgtgg agatgcccca ggtggaggag 17837 
gagctgcctc ccctggacaa gcgcggcgac aagcgaccgc gtcccgacgc ggaggagacg 17897 
ctgctgacgc acacggacga gccgcccccg tacgaggagg cggtgaaact gggcctgccc 17957 
accacgcggc ccgtggcgcc tctggccacc ggagtgctga aacccagcag cagccagccc 18017 
gcgaccctgg acttgcctcc gcctcgcccc tccacagtgg ctaagcccct gccgccggtg 18077 
gccgtcgcgt cgcgcgcccc ccgaggccgc ccccaggcga actggcagag cactctgaac 18137 
agcatcgtgg gtctgggagt gcagagtgtg aagcgccgcc gctgctatta aaagacactg 18197 
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tagcgcttaa cttgcttgtc tgtgtgtata tgtatgtccg ccgaccagaa ggaggagtgt 18257 

gaagaggcgc gtcgccgagt tgcaag atg gcc acc cca teg atg ctg ccc cag 18310 

Met Ala Thr Pro Ser Met Leu Pro Gin 
530 535 

tgg gcg tac atg cac ate gcc gga cag gac get teg gag tac ctg agt 18358 
Trp Ala Tyr Met His He Ala Gly Gin Asp Ala Ser Glu Tyr Leu Ser 
540 545 550 

ccg ggt ctg gtg cag ttc gcc cgc gcc aca gac acc tac ttc agt ctg 18406 
Pro Gly Leu Val Gin Phe Ala Arg Ala Thr Asp Thr Tyr Phe Ser Leu 
555 560 565 570 

ggg aac aag ttt agg aac ccc acg gtg gcg ccc acg cac gat gtg acc 18454 
Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro Thr His Asp Val Thr 
575 580 585 

acc gac cgc age cag egg ctg acg ctg cgc ttc gtg ccc gtg gac cgc 18502 
Thr Asp Arg Ser Gin Arg Leu Thr Leu Arg Phe Val Pro Val Asp Arg 
590 595 600 

gag gac aac acc tac teg tac aaa gtg cgc tac acg ctg gcc gtg ggc 18550 
Glu Asp Asn Thr Tyr Ser Tyr Lys Val Arg Tyr Thr Leu Ala Val Gly 
605 610 615 



gac aac cgc gtg ctg gac atg gcc age acc tac ttt gac ate cgc ggc 

Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr Phe Asp He Arg Gly 

620 625 630 

gtg ctg gac egg ggc cct age ttc aaa ccc tac tct ggc acc gcc tac 

Val Leu Asp Arg Gly Pro Ser Phe Lys Pro Tyr Ser Gly Thr Ala Tyr 

635 640 645 ~ 650 



18598 



18646 



aac age eta get ccc aag gga get ccc aat tec age cag tgg gag caa 18694 
Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Ser Ser Gin Trp Glu Gin 
655 660 665 

gca aaa aca ggc aat ggg gga act atg gaa aca cac aca tat ggt gtg 18742 
Ala Lys Thr Gly Asn Gly Gly Thr Met Glu Thr His Thr Tyr Gly Val 
670 675 680 

gcc cca atg ggc gga gag aat att aca aaa gat ggt ctt caa att gga 18790 
Ala Pro Met Gly Gly Glu Asn He Thr Lys Asp Gly Leu Gin He Gly 
685 690 695 

act gac gtt aca gcg aat cag aat aaa cca att tat gcc gac aaa aca 18838 
Thr Asp Val Thr Ala Asn Gin Asn Lys Pro He Tyr Ala Asp Lys Thr 
700 705 710 

ttt caa cca gaa ccg caa gta gga gaa gaa aat tgg caa gaa act gaa 18886 
Phe Gin Pro Glu Pro Gin Val Gly Glu Glu Asn Trp Gin Glu Thr Glu 
715 720 725 730 

aac ttt tat ggc ggt aga get ctt aaa aaa gac aca aac atg aaa cct 18934 
Asn Phe Tyr Gly Gly Arg Ala Leu Lys Lys Asp Thr Asn Met Lys Pro 
735 740 ~ 745 
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tgc tat ggc tec tat get aga ccc acc aat gaa aaa gga ggt caa get 18982 

Cys Tyr Gly Ser Tyr Ala Arg Pro Thr Asn Glu Lys Gly Gly Gin Ala 

750 755 760 

aaa ctt aaa gtt gga gat gat gga gtt cca acc aaa gaa ttc gac ata 19030 

Lys Leu Lys Val Gly Asp Asp Gly Val Pro Thr Lys Glu Phe Asp lie 

765 770 775 

gac ctg get ttc ttt gat act ccc ggt ggc acc gtg aac ggt caa gac 19078 

Asp Leu Ala Phe Phe Asp Thr Pro Gly Gly Thr Val Asn Gly Gin Asp 
780 785 790 



gag tat aaa gca gac att gtc atg tat acc gaa aac acg tat ttg gaa 
Glu Tyr Lys Ala Asp lie Val Met Tyr Thr Glu Asn Thr Tyr Leu Glu 
795 800 805 810 



19126 



act cca gac acg cat gtg gta tac aaa cca ggc aag gat gat gca agt 19174 
Thr Pro Asp Thr His Val Val Tyr Lys Pro Gly Lys Asp Asp Ala Ser 
815 820 * 825 

tct gaa att aac ctg gtt cag cag tct atg ccc aac aga ccc aac tac 19222 
Ser Glu lie Asn Leu Val Gin Gin Ser Met Pro Asn Arg Pro Asn Tyr 
830 835 840 

att ggg ttc agg gac aac ttt ate ggt ctt atg tac tac aac age act 19270 
lie Gly Phe Arg Asp Asn Phe lie Gly Leu Met Tyr Tyr Asn Ser Thr 
845 850 ~ 855 

ggc aat atg ggt gtg ctt get ggt cag gee tec cag ctg aat get gtg 19318 
Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala Val 
860 865 870 

gtt gat ttg caa gac aga aac acc gag ctg tec tac cag etc ttg ctt 19366 
Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Leu Leu 
875 880 885 890 

gac tct ttg ggt gac aga acc egg tat ttc agt atg tgg aac cag gcg 19414 
Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gin Ala 
895 900 905 

gtg gac agt tat gac ccc gat gtg cgc ate ate gaa aac cat ggt gtg 194 62 
Val Asp Ser Tyr Asp Pro Asp Val Arg lie lie Glu Asn His Gly Val 
910 915 920 

gag gat gaa ttg cca aac tat tgc ttc ccc ttg gac ggc tct ggc act 19510 
Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Gly Ser Gly Thr 
925 930 935 

aac gee gca tac caa ggt gtg aaa gta aaa gat ggt caa gat ggt gat 19558 
Asn Ala Ala Tyr Gin Gly Val Lys Val Lys Asp Gly Gin Asp Gly Asp 
940 945 950 

gtt gag agt gaa tgg gaa aat gac gat act gtt gca get cga aat caa 19606 
Val Glu Ser Glu Trp Glu Asn Asp Asp Thr Val Ala Ala Arg Asn Gin 
955 960 965 970 
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tta tgt aaa ggt aac att ttc gcc atg gag att aat etc cag get aac 19654 

Leu Cys Lys Gly Asn lie Phe Ala Met Glu He Asn Leu Gin Ala Asn 
975 980 985 

ctg tgg aga agt ttc etc tac teg aac gtg gcc ctg tac ctg ccc gac 19702 

Leu Trp Arg Ser Phe Leu Tyr Ser Asn Val Ala Leu Tyr Leu Pro Asp 

990 995 1000 



tec tac aag tac acg ccg acc aac gtc acg ctg ccg ace aac ace 
Ser Tyr Lys Tyr Thr Pro Thr Asn Val Thr Leu Pro Thr Asn Thr 
1005 1010 1015 



19747 



aac acc tac gat tac atg aat ggc aga gtg aca cct ccc teg ctg 
Asn Thr Tyr Asp Tyr Met Asn Gly Arg Val Thr Pro Pro Ser Leu 
1020 1025 1030 



19792 



gta gac gcc 
Val Asp Ala 



tac etc aac ate ggg gcg cgc tgg teg ctg gac ccc 
Tyr Leu Asn He Gly Ala Arg Trp Ser Leu Asp Pro 
1035 1040 1045 



atg gac aac gtc aac ccc ttc aac cac cac cgc aac gcg ggc ctg 
Met Asp Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu 
1050 1055 1060 



19837 



19882 



cgc tac cgc tec atg etc ctg ggc aac ggg cgc tac gtg ccc ttc 

Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe 
1065 1070 ~ 1075 

cac ate cag gtg ccc caa aag ttt ttc gcc ate aag age etc ctg 

His He Gin Val Pro Gin Lys Phe Phe Ala He Lys Ser Leu Leu 
1080 1085 1090 



19927 



19972 



etc ctg ccc 
Leu Leu Pro 



ggg tec tac acc tac gag tgg aac ttc cgc aag gac 
Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp 
1095 HOO H05 



20017 



gtc aac atg ate ctg cag age tec 
Val Asn Met He Leu Gin Ser Ser 
1110 1115 



eta ggc aac gac ctg cgc acg 
Leu Gly Asn Asp Leu Arg Thr 
1120 



20062 



gac ggg gcc tec ate gcc ttc acc age ate aac etc tac gcc acc 
Asp Gly Ala Ser He Ala Phe Thr Ser He Asn Leu Tyr Ala Thr 
1125 1130 1135 



20107 



ttc ttc ccc atg gcg cac aac acc gcc tec acg etc gag gcc atg 
Phe Phe Pro Met Ala His Asn Thr Ala Ser Thr Leu Glu Ala Met 
H40 1145 1150 



20152 



ctg cgc aac gac acc aac gac cag 
Leu Arg Asn Asp Thr Asn Asp Gin 
1155 ~ 1160 



tec ttc aac gac tac etc teg 
Ser Phe Asn Asp Tyr Leu Ser 
1165 



20197 



gcg gcc aac atg etc tac ccc ate ccg gcc aac gcc acc aac gtg 
Ala Ala Asn Met Leu Tyr Pro He Pro Ala Asn Ala Thr Asn Val 
H70 1175 1180 



20242 
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ccc ate tec 
Pro lie Ser 
1185 

tec ttc acg 
Ser Phe Thr 
1200 

ggg ttc gac 
Gly Phe Asp 
1215 

gac ggc acc 
Asp Gly Thr 
1230 

acc ttc gac 
Thr Phe Asp 
1245 

acg ccc aac 
Thr Pro Asn 
1260 

tac aac gtg 
Tyr Asn Val 
1275 

cag atg ctg 
Gin Met Leu 
1290 

ccc gag ggc 
Pro Glu Gly 
1305 

cag ccc atg 
Gin Pro Met 

1320 

tac cag gec 
Tyr Gin Ala 
1335 

gtc ggc tac 
Val Gly Tyr 
1350 

gec aac tac 
Ala Asn Tyr 
1365 

gtc acc cag 
Val Thr Gin 
1380 



ate ccc teg 
lie Pro Ser 



cgc ctg aag 
Arg Leu Lys 



ccc tac ttc 
Pro Tyr Phe 



ttc tac etc 
Phe Tyr Leu 



tec tec gtc 
Ser Ser Val 



gag ttc gaa 
Glu Phe Glu 



gee cag tgc 
Ala Gin Cys 



gec cac tac 
Ala His Tyr 



tac aag gac 
Tyr Lys Asp 



age cgc cag 
Ser Arg Gin 



gtc acc ctg 
Val Thr Leu 



etc gcg ccc 
Leu Ala Pro 



ccc tac ccg 
Pro Tyr Pro 



aaa aag ttc 
Lys Lys Phe 



cgc aac tgg 
Arg Asn Trp 
1190 

acc cgc gag 
Thr Arg Glu 
1205 

gtc tac teg 
Val Tyr Ser 

1220 

aac cac acc 
Asn His Thr 
1235 

age tgg ccc 
Ser Trp Pro 
1250 

ate aag cgc 
lie Lys Arg 
1265 

aac atg acc 
Asn Met Thr 
1280 

aac ate ggc 
Asn lie Gly 
1295 

cgc atg tac 
Arg Met Tyr 
1310 

gtc gtg gac 
Val Val Asp 
1325 

gee tac cag 
Ala Tyr Gin 
1340 

acc atg cgc 
Thr Met Arg 
1355 

etc ate ggc 
Leu lie Gly 
1370 

etc tgc gac 
Leu Cys Asp 
1385 



gee gee ttc 
Ala Ala Phe 



acg ccc teg 
Thr Pro Ser 



ggc tec ate 
Gly Ser lie 



ttc aag aag 
Phe Lys Lys 



ggc aac gac 
Gly Asn Asp 



acc gtc gac 
Thr Val Asp 



aag gac tgg 
Lys Asp Trp 



tac cag ggc 
Tyr Gin Gly 



tec ttc ttc 
Ser Phe Phe 



gag gtc aac 
Glu Val Asn 



cac aac aac 
His Asn Asn 



cag ggc cag 
Gin Gly Gin 



aag age gec 
Lys Ser Ala 



egg gtc atg 
Arg Val Met 



cgc gga tgg 
Arg Gly Trp 
1195 

etc ggc tec 
Leu Gly Ser 
1210 

ccc tac eta 
Pro Tyr Leu 
1225 

gtc tec ate 
Val Ser lie 
1240 

cgc etc ctg 
Arg Leu Leu 
1255 

gga gag gga 
Gly Glu Gly 
1270 

ttc ctg gtc 
Phe Leu Val 
1285 

ttc tac gtg 
Phe Tyr Val 
1300 

cgc aac ttc 
Arg Asn Phe 
1315 

tac aag gac 
Tyr Lys Asp 
1330 

teg ggc ttc 
Ser Gly Phe 
1345 

ccc tac ccc 
Pro Tyr Pro 
1360 

gtc gee age 
Val Ala Ser 
1375 

tgg cgc ate 
Trp Arg lie 
1390 



20287 



20332 



20377 



20422 



20467 



20512 



20557 



20602 



20647 



20692 



20737 



20782 



20827 



20872 
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ccc ttc tec age aac ttc atg tec atg ggc gcg etc ace gac etc 20917 
Pro Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu 
1395 1400 "* 1405 

ggc cag aac atg etc tac gee aac tec gee cac gcg eta gac atg 20962 
Gly Gin Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met 
1410 1415 1420 

aat ttc gaa gtc gac ccc atg gat gag tec acc ctt etc tat gtt 21007 
Asn Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val 
1425 1430 1435 

gtc ttc gaa gtc ttc gac gtc gtc cga gtg cac cag ccc cac cgc 21052 
Val Phe Glu Val Phe Asp Val Val Arg Val His Gin Pro His Arg 
1440 1445 1450 

ggc gtc ate gaa gee gtc tac ctg cgc acg ccc ttc teg gee ggc 21097 
Gly Val lie Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly 
1455 1460 1465 

aac gee acc acc taa gccgctcttg ettcttgeaa gatgaeggeg ggctccggcg 21152 
Asn Ala Thr Thr 
1470 

agcaggagct cagggccatc ctccgcgacc tgggctgegg gccctgcttc ctgggcacct 21212 

tegacaageg cttccctgga ttcatggccc cgcacaagct ggcctgcgcc ategtgaaca 21272 

cggccggccg egagaceggg ggegagcact ggctggcctt cgcctggaac .ccgcgctccc 21332 

acacatgeta cctcttcgac cccttcgggt teteggaega gcgcctcaag cagatctacc 21392 

agttcgagta egagggectg ctgcgtcgca gcgccctggc caccgaggac cgctgcgtca 21452 

ccctggaaaa gtccacccag acegtgeagg gtccgcgctc ggccgcctgc gggctcttct 21512 

gctgcatgtt cctgcacgcc ttcgtgcact ggcccgaccg ccccatggac aagaacccca 21572 

ccatgaactt actgaegggg gtgcccaacg gcatgctcca gtcgccccag gtggaaccca 21632 

ccctgcgccg caaccaggaa gcgctctacc gcttcctcaa tgcccactcc gcctactttc 21692 

gctcccaccg cgcgcgcatc gagaaggeca ccgccttcga ccgcatgaat caagacatgt 21752 

aaaaaacegg tgtgtgtatg tgaatgcttt attcataata aacagcacat gtttatgeca 21812 

ccttctctga ggctctgact ttatttagaa atcgaagggg ttctgccggc teteggcatg 21872 

gcccgcgggc agggatacgt tgcggaactg gtacttgggc agecacttga acteggggat 21932 

cagcagcttg ggcacgggga ggtcggggaa cgagtcgctc cacagcttgc gcgtgagttg 21992 

cagggcgccc ageaggtegg gegeggagat cttgaaatcg cagttgggac ccgcgttctg 22052 

cgegegagag ttgeggtaca eggggttgea gcactggaac accatcaggg ccgggtgctt 22112 

cacgcttgcc agcaccgtcg cgtcggtgat gccctccacg tccagatcct cggcgttggc 22172 
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catcccgaag 


ggggtcatct 


tgcaggtctg 


ccgccccatg 


ctgggcacgc 


agccgggctt 


22232 


gtggttgcaa 


tcgcagtgca 


gggggatcag 


catcatctgg 


gcctgctcgg 


agctcatgcc 


22292 


cgggtacatg 


gccttcatga 


aagcctccag 


ctggcggaag 


gcctgctgcg 


ccttgccgcc 


22352 


ctcggtgaag 


aagaccccgc 


aggacttgct 


agagaactgg 


ttggtggcgc 


agccggcgtc 


22412 


gtgcacgcag 


cagcgcgcgt 


cgttgttggc 


cagctgcacc 


acgctgcgcc 


cccagcggtt 


22472 


ctgggtgatc 


ttggcccggt 


tggggttctc 


cttcagcgcg 


cgctgcccgt 


tctcgctcgc 


22532 


cacatccatc 


tcgatagtgt 


gctccttctg 


gatcatcacg 


gtcccgtgca 


ggcaccgcag 


22592 


cttgccctcg 


gcttcggtgc 


agccgtgcag 


ccacagcgcg 


cagccggtgc 


actcccagtt 


22652 


cttgtgggcg 


atctgggagt 


gcgagtgcac 


gaagccctgc 


aggaagcggc 


ccatcatcgc 


22712 


ggtcagggtc 


ttgttgctgg 


tgaaggtcag 


cgggatgccg 


cggtgctcct 


cgttcacata 


22772 


caggtggcag 


atgcggcggt 


acacctcgcc 


ctgctcgggc 


atcagctgga 


aggcggactt 


22832 


caggtcgctc 


tccacgcggt 


accggtccat 


cagcagcgtc 


atcacttcca 


tgcccttctc 


22892 


ccaggccgaa 


acgatcggca 


ggctcagggg 


gttcttcacc 


gccattgtca 


tcttagtcgc 


22952 


cgccgccgag 


gtcagggggt 


cgttctcgtc 


cagggtctca 


aacactcgct 


tgccgtcctt 


23012 


ctcgatgatg 


cgcacggggg 


gaaagctgaa 


gcccacggcc 


gccagctcct 


cctcggcctg 


23072 


cctttcgtcc 


tcgctgtcct 


ggctgatgtc 


ttgcaaaggc 


acatgcttgg 


tcttgcgggg 


23132 


tttctttttg 


ggcggcagag 


gcggcggcga 


tgtgctggga 


gagcgcgagt 


tctcgttcac 


23192 


cacgactatt 


tcttcttctt 


ggccgtcgtc 


cgagaccacg 


cggcggtagg 


catgcctctt 


23252 


ctggggcaga 


ggcggaggcg 


acgggctctc 


gcggttcggc 


gggcggctgg 


cagagcccct 


23312 


tccgcgttcg 


ggggtgcgct 


cctggcggcg 


ctgctctgac 


tgacttcctc 


cgcggccggc 


23372 


cattgtgttc 


tcctagggag 


caacaacaag 


catggagact 


cagccatcgt 


cgccaacatc 


23432 


gccatctgcc 


cccgccgcca 


ccgccgacga 


gaaccagcag 


cagaatgaaa 


gcttaaccgc 


23492 


cccgccgccc 


agccccacct 


ccgacgccgc 


ggccccagac 


atgcaagaga 


tggaggaatc 


23552 


catcgagatt 


gacctgggct 


acgtgacgcc 


cgcggagcac 


gaggaggagc 


tggcagcgcg 


23612 


cttttcagcc 


ccggaagaga 


accaccaaga 


gcagccagag 


caggaagcag 


agaacgagca 


23672 


gaaccaggct 


gggcacgagc 


atggcgacta 


cctgagcggg 


gcagaggacg 


tgctcatcaa 


23732 


gcatctggcc 


cgccaatgca 


tcatcgtcaa 


ggacgcgctg 


ctcgaccgcg 


ccgaggtgcc 


23792 


cctcagcgtg 


gcggagctca 


gccgcgccta 


cgagcgcaac 


ctcttctcgc 


cgcgcgtgcc 


23852 
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ccccaagcgc 


cagcccaacg 


gcacctgtga 


gcccaacccg 


cgcctcaact 


tctacccggt 


23912 


cttcgcggtg 


cccgaggccc 


tggccaccta 


ccacctcttt 


ttcaagaacc 


aaaggatccc 


23972 


cgtctcctgc 


cgcgccaacc 


gcacccgcgc 


cgacgccctg 


ctcaacctgg 


gccccggcgc 


24032 


ccgcctacct 


gatatcacct 


ccttggaaga 


ggttcccaag 


atcttcgagg 


gtctgggcag 


24092 


cgacgagact 


cgggccgcga 


acgctctgca 


aggaagcgga 


gaggagcatg 


agcaccacag 


24152 


cgccctggtg 


gagttggaag 


gcgacaacgc 


gcgcctggcg 


gtcctcaagc 


gcacggtcga 


24212 


gctgacccac 


ttcgcctacc 


cggcgctcaa 


cctgcccccc 


aaggtcatga 


gcgccgtcat 


24272 


ggaccaggtg 


ctcatcaagc 


gcgcctcgcc 


cctctcggag 


gaggagatgc 


aggaccccga 


24332 


gagttcggac 


gagggcaagc 


ccgtggtcag 


cgacgagcag 


ctggcgcgct 


ggctgggagc 


24392 


gagtagcacc 


ccccagagcc 


tggaagagcg 


gcgcaagctc 


atgatggccg 


tggtcctggt 


24452 


gaccgtggag 


ctggagtgtc 


tgcgccgctt 


ctttgccgac 


gcggagaccc 


tgcgcaaggt 


24512 


cgaggagaac 


ctgcactacc 


tcttcaggca 


cgggttcgtg 


cgccaggcct 


gcaagatctc 


24572 


caacgtggag 


ctgaccaacc 


tggtctccta 


catgggcatc 


ctgcacgaga 


accgcctggg 


24632 


gcaaaacgtg 


ctgcacacca 


ccctgcgcgg 


ggaggcccgc 


cgcgactaca 


tccgcgactg 


24692 


cgtctacctg 


tacctctgcc 


acacctggca 


gacgggcatg 


ggcgtgtggc 


agcagtgcct 


24752 


ggaggagcag 


aacctgaaag 


agctctgcaa 


gctcctgcag 


aagaacctca 


aggccctgtg 


24812 


gaccgggttc 


gacgagcgta 


ccaccgcctc 


ggacctggcc 


gacctcatct 


tccccgagcg 


24872 


cctgcggctg 


acgctgcgca 


acgggctgcc 


cgactttatg 


agccaaagca 


tgttgcaaaa 


24932 


ctttcgctct 


ttcatcctcg 


aacgctccgg 


gatcctgccc 


gccacctgct 


ccgcgctgcc 


24992 


ctcggacttc 


gtgccgctga 


ccttccgcga 


gtgccccccg 


ccgctctgga 


gccactgcta 


25052 


cttgctgcgc 


ctggccaact 


acctggccta 


ccactcggac 


gtgatcgagg 


acgtcagcgg 


25112 


cgagggtctg 


ctggagtgcc 


actgccgctg 


caacctctgc 


acgccgcacc 


gctccctggc 


25172 


ctgcaacccc 


cagctgctga 


gcgagaccca 


gatcatcggc 


accttcgagt 


tgcaaggccc 


25232 


cggcgacggc 


gagggcaagg 


ggggtctgaa 


actcaccccg 


gggctgtgga 


cctcggccta 


25292 


cttgcgcaag 


ttcgtgcccg 


aggactacca 


tcccttcgag 


atcaggttct 


acgaggacca 


25352 


atcccagccg 


cccaaggccg 


agctgtcggc 


ctgcgtcatc 


acccaggggg 


ccatcctggc 


25412 


ccaattgcaa 


gccatccaga 


aatcccgcca 


agaatttctg 


ctgaaaaagg 


gccacggggt 


25472 


ctacttggac 


ccccagaccg 


gagaggagct 


caaccccagc 


ttcccccagg 


atgccccgag 


25532 


gaagcagcaa 


gaagctgaaa 


gtggagctgc 


cgccgccgga 


ggatttggag 


gaagactggg 


25592 
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agagcagtca 


ggcagaggag 


gaggagatgg 


aagactggga 


cagcactcag 


gcagaggagg 


25652 


acagcctgca 


agacagtctg 


gaggaggaag 


acgaggtgga 


ggaggcagag 


gaagaagcag 


25712 


ccgccgccag 


accgtcgtcc 


tcggcggaga 


aagcaagcag 


cacggatacc 


atctccgctc 


25772 


cgggtcgggg 


tcgcggcggc 


cgggcccaca 


gtaggtggga 


cgagaccggg 


cgcttcccga 


25832 


accccaccac 


ccagaccggt 


aagaaggagc 


ggcagggata 


caagtcctgg 


cgggggcaca 


25892 


aaaacgccat 


cgtctcctgc 


ttgcaagcct 


gcgggggcaa 


catctccttc 


acccggcgct 


25952 


acctgctctt 


ccaccgcggg 


gtgaacttcc 


cccgcaacat 


cttgcattac 


taccgtcacc 


26012 


tccacagccc 


ctactactgt 


ttccaagaag 


aggcagaaac 


ccagcagcag 


cagaaaacca 


26072 


gcggcagcag 


cagctagaaa 


atccacagcg 


gcggcaggtg 


gactgaggat 


cgcggcgaac 


26132 


gagccggcgc 


agacccggga 


gctgaggaac 


cggatctttc 


ccaccctcta 


tgccatcttc 


26192 


cagcagagtc 


gggggcagga 


gcaggaactg 


aaagtcaaga 


accgttctct 


gcgctcgctc 


26252 


acccgcagtt 


gtctgtatca 


caagagcgaa 


gaccaacttc 


agcgcactct 


cgaggacgcc 


26312 


gaggctctct 


tcaacaagta 


ctgcgcgctc 


actcttaaag 


agtagcccgc 


gcccgcccac 


26372 


acacggaaaa 


aggcgggaat 


tacgtcacca 


cctgcgccct 


tcgcccgacc 


atcatgagca 


26432 


aagagattcc 


cacgccttac 


atgtggagct 


accagcccca 


gatgggcctg 


gccgccggcg 


26492 


ccgcccagga 


ctactccacc 


cgcatgaact 


ggctcagtgc 


cgggcccgcg 


atgatctcac 


26552 


gggtgaatga 


catccgcgcc 


caccgaaacc 


agatactcct 


agaacagtca 


gcgatcaccg 


26612 


ccacgccccg 


ccatcacctt 


aatccgcgta 


attggcccgc 


cgccctggtg 


taccaggaaa 


26672 


ttccccagcc 


cacgaccgta 


ctacttccgc 


gagacgccca 


ggccgaagtc 


cagctgacta 


26732 


actcaggtgt 


ccagctggcc 


ggcggcgccg 


ccctgtgtcg 


tcaccgcccc 


gctcagggta 


26792 


taaagcggct 


ggtgatccga 


ggcagaggca 


cacagctcaa 


cgacgaggtg 


gtgagctctt 


26852 


cgctgggtct 


gcgacctgac 


ggagtcttcc 


aactcgccgg 


atcggggaga 


tcttccttca 


26912 


cgcctcgtca 


ggccgtcctg 


actttggaga 


gttcgtcctc 


gcagccccgc 


tcgggcggca 


26972 


tcggcactct 


ccagttcgtg 


gaggagttca 


ctccctcggt 


ctacttcaac 


ccctt ctccg 


27032 


gctcccccgg 


ccactacccg 


gacgagttca 


tcccgaactt 


cgacgccatc 


agcgagtcgg 


27092 


tggacggcta 


cgattgaatg 


tcccatggtg 


gcgcagctga 


cctagctcgg 


cttcgacacc 


27152 


tggaccactg 


ccgccgcttc 


cgctgcttcg 


ctcgggatct 


cgccgagttt 


gcctactttg 


27212 


agctgcccga 


ggagcaccct 


cagggcccag 


cccacggagt 


gcggatcatc 


gtcgaagggg 


27272 
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gcctcgactc ccacctgctt cggatcttca gccagcgacc gatcctggtc gagcgcgaac 27332 

aaggacagac ccttcttact ttgtactgca tctgcaacca ccccggcctg catgaaagtc 27392 

tttgttgtct gctgtgtact gagtataata aaagctgaga tcagcgacta ctccggactc 27452 

gattgtggtg ttcctgctat caaccggtcc ctgttcttca ccgggaacga gaccgagctc 27512 

cagctccagt gtaagcccca caagaagtac ctcacctggc tgttccaggg ctccccgatc 27572 

gccgttgtca accactgcga caacgacgga gtcctgctga gcggccctgc caaccttact 27632 

ttttccaccc gcagaagcaa gctccagctc ttccaaccct tcctccccgg gacctatcag 27692 

tgcgtctcag gaccctgcca tcacaccttc cacctgatcc cgaataccac agcgccgctc 27752 

cccgctacta acaaccaaac tacccaccaa cgccaccgtc gcgacctttc ctctgaatct 27812 

aataccacta ccggaggtga gctccgaggt cgaccaacct ctgggattta ctacggcccc 27872 

tgggaggtgg tggggttaat agcgctaggc ctagttgcgg gtgggctttt ggttctctgc 27932 

tacctatacc tcccttgctg ttcgtactta gtggtgctgt gttgctggtt taagaaatgg 27992 

ggaagatcac cctagtgagc tgcggtgcgc tggtggcggt gttgctttcg attgtgggac 28052 

tgggcggcgc ggctgtagtg aaggagaagg ccgatccctg cttgcatttc aatcccaaca 2 8112 

aatgccagct gagttttcag cccgatggca atcggtgcgc ggtactgatc aagtgcggat 28172 

gggaatgcga gaacgtgaga atcgagtaca ataacaagac tcggaacaat actctcgcgt 28232 

ccgtgtggca gcccggggac cccgagtggt acaccgtctc tgtccccggt gctgacggct 28292 

ccccgcgcac cgtgaataat actttcattt ttgcgcacat gtgcaacacg gtcatgtgga 28352 

tgagcaagca gtacgatatg tggcccccca cgaaggagaa catcgtggtc ttctccatcg 28412 

cttacagcct gtgcacggcg ctaatcaccg ctatcgtgtg cctgagcatt cacatgctca 28472 

tcgctattcg ccccagaaat aatgccgaga aagagaaaca gccataacac gttttttcac 28532 

acaccttgtt tttacagaca atgcgtctgt taaatttttt aaacattgtg ctcagtattg 28592 

cttatgcctc tggttatgca aacatacaga aaacccttta tgtaggatct gatggtacac 28652 

tagagggtac ccaatcacaa gccaaggttg catggtattt ttatagaacc aacactgatc 28712 

cagttaaact ttgtaagggt gaattgccgc gtacacataa aactccactt acatttagtt 28772 

gcagcaataa taatcttaca cttttttcaa ttacaaaaca atatactggt acttattaca 28832 

gtacaaactt tcatacagga caagataaat attatactgt taaggtagaa aatcctacca 28892 

ctcctagaac taccaccacc accactactg caaagcccac tgtgaaaact acaactagga 28952 

ccaccacaac tacagaaacc accaccagca caacacttgc tgcaactaca cacacacaca 29012 
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ctaagctaac 


cttacagacc 


actaatgatt 


tgatcgccct 


gctgcaaaag 


ggggataaca 


29072 


gcaccacttc 


caatgaggag 


atacccaaat 


ccatgattgg 


cattattgtt 


gctgtagtgg 


29132 


tgtgcatgtt 


gatcatcgcc 


ttgtgcatgg 


tgtactatgc 


cttctgctac 


agaaagcaca 


29192 


gactgaacga 


caagctggaa 


cacttactaa 


gtgttgaatt 


ttaatttttt 


agaaccatga 


29252 


agatcctagg 


cctttttagt 


ttttctatca 


ttacctctgc 


tctttgtgaa 


tcagtggata 


29312 


gagatgttac 


tattaccact 


ggttctaatt 


atacactgaa 


agggccaccc 


tcaggtatgc 


29372 


tttcgtggta 


ttgctatttt 


ggaactgaca 


ctgatcaaac 


tgaattatgc 


aattttcaaa 


29432 


aaggcaaaac 


ctcaaactct 


aaaatctcta 


attatcaatg 


caatggcact 


gatctgatac 


29492 


tactcaatgt 


cacgaaagca 


tatggtggca 


gttattattg 


ccctggacaa 


aacactgaag 


29552 


aaatgatttt 


ttacaaagtg 


gaagtggttg 


atcccactac 


accacccacc 


accacaacta 


29612 


ttcataccac 


acacacagaa 


caaacaccag 


aggcaacaga 


agcagagttg 


gccttccagg 


29672 


ttcacggaga 


ttcctttgct 


gtcaataccc 


ctacacccga 


tcagcggtgt 


ccggggccgc 


29732 


tagtcagcgg 


cattgtcggt 


gtgctttcgg 


gattagcagt 


cataatcatc 


tgcatgttca 


29792 


tttttgcttg 


ctgctataga 


aggctttacc 


gacaaaaatc 


agacccactg 


ctgaacctct 


29852 


atgtttaatt 


ttttccagag 


ccatgaaggc 


agttagcgct 


ctagtttttt 


gttctttgat 


29912 


tggcattgtt 


tttaatagta 


aaattaccag 


agttagcttt 


attaaacatg 


ttaatgtaac 


29972 


tgaaggagat 


aacatcacac 


tagcaggtgt 


agaaggtgct 


caaaacacca 


cctggacaaa 


30032 


ataccatcta 


ggatggagag 


atatttgcac 


ctggaatgta 


acttattatt 


gcataggagt 


30092 


taatcttacc 


attgttaacg 


ctaaccaatc 


tcagaatggg 


ttaattaaag 


gacagagtgt 


30152 


tagtgtgacc 


agtgatgggt 


actataccca 


gcatagtttt 


aactacaaca 


ttactgtcat 


30212 


accactgcct 


acgcctagcc 


cacctagcac 


taccacacag 


acaaccacat 


acagtacatc 


30272 


aaatcagcct 


accaccacta 


cagcagcaga 


ggttgccagc 


tcgtctgggg 


tccgagtggc 


30332 


atttttgatg 


ttggccccat 


ctagcagtcc 


cactgctagt 


accaatgagc 


agactactga 


30392 


atttttgtcc 


actgtcgaga 


gccacaccac 


agctacctcc 


agtgccttct 


ctagcaccgc 


30452 


caatctctcc 


tcgctttcct 


ctacaccaat 


cagccccgct 


actactccta 


gccccgctcc 


30512 


tcttcccact 


cccctgaagc 


aaacagacgg 


cggcatgcaa 


tggcagatca 


ccctgctcat 


30572 


tgtgatcggg 


ttggtcatcc 


tggccgtgtt 


gctctactac 


atcttctgcc 


gccgcattcc 


30632 


caacgcgcac 


cgcaagccgg 


cctacaagcc 


catcgttatc 


gggcagccgg 


agccgcttca 


30692 
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ggtggaaggg 


ggtctaagga 


atcttctctt 


ctcttttaca 


gtatggtgat 


tgaactatga 


30752 


ttcctagaca 


attcttgatc 


actattctta 


tctgcctcct 


ccaagtctgt 


gccaccctcg 


30812 


ctctggtggc 


caacgccagt 


ccagactgta 


ttgggccctt 


cgcctcctac 


gtgctctttg 


30872 


ccttcgtcac 


ctgcatctgc 


tgctgtagca 


tagtctgect 


gcttatcacc 


ttcttccagt 


30932 


tcattgactg 


gatctttgtg 


cgcatcgcct 


acctgcgcca 


ccacccccag 


taccgcgacc 


30992 


agcgagtggc 


gcagctgctc 


aggctcctct 


gataagcatg 


cgggctctgc 


tacttctcgc 


31052 


gcttctgctg 


ttagtgctcc 


cccgtcccgt 


cgacccccgg 


tcccccactc 


agtcccccga 


31112 


ggaggttcgc 


aaatgcaaat 


tccaagaacc 


ctggaaattc 


etcaaatget 


accgccaaaa 


31172 


atcagacatg 


catcccagct 


ggatcatgat 


cattgggatc 


gtgaacattc 


tggcctgcac 


31232 


cctcatctcc 


tttgtgattt 


acccctgctt 


tgactttggt 


tggaactege 


cagaggeget 


31292 


ctatctcccg 


cctgaacctg 


acacaccacc 


acagcagcaa 


cctcaggcac 


acgcactacc 


31352 


accaccacag 


cctaggccac 


aatacatgcc 


catattagac 


tatgaggecg 


agccacagcg 


31412 


acccatgctc 


cccgctatta 


gttacttcaa 


tctaaccggc 


ggagatgact 


gacccactgg 


31472 


ccaataacaa 


cgtcaacgac 


cttctcctgg 


acatggaegg 


ccgcgcctcg 


gagcagegae 


31532 


tcgcccaact 


tcgcattcgt 


cagcagcagg 


agagagcegt 


caaggagctg 


caggaeggea 


31592 


tagccatcca 


ccagtgcaag 


agaggcatct 


tctgcctggt 


gaaacaggee 


aagatctcct 


31652 


acgaggtcac 


ccagaccgac 


catcgcctct 


cctacgagct 


cctgcagcag 


cgccagaagt 


31712 


tcacctgcct 


ggtcggagtc 


aaccccatcg 


tcatcaccca 


geagteggge 


gataccaagg 


31772 


ggtgcatcca 


ctgctcctgc 


gactcccccg 


actgcgtcca 


cactctgatc 


aagaccctct 


31832 


gcggcctccg 


cgacctcctc 


cccatgaact 


aatcaccccc 


ttatccagtg 


aaataaagat 


31892 


catattgatg 


atgatttaaa 


taaaaaaaat 


aatcatttga 


tttgaaataa 


agatacaatc 


31952 


atattgatga 


tttgagttta 


acaaaaataa 


agaatcactt 


acttgaaatc 


tgataccagg 


32012 


tctctgtcca 


tgttttctgc 


caacaccacc 


tcactcccct 


cttcccagct 


ctggtactgc 


32072 


aggccccggc 


gggctgcaaa 


cttcctccac 


aegctgaagg 


ggatgtcaaa 


ttcctcctgt 


32132 


ccctcaatct 


tcattttatc 


ttctatcag atg tec aaa 
Met Ser Lys 


aag cgc gtc egg gtg 
Lys Arg Val Arg Val 


32185 



1475 

gat gat gac ttc gac ccc gtc tac ccc tac gat gca gac aac gca 32230 
Asp Asp Asp Phe Asp Pro Val Tyr Pro Tyr Asp Ala Asp Asn Ala 
1480 1485 1490 
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ccg acc gtg ccc ttc ate aac ccc ccc ttc gtc tct tea gat gga 32275 
Pro Thr Val Pro Phe lie Asn Pro Pro Phe Val Ser Ser Asp Glv 
1495 1500 



1505 



ttc caa gag aag ccc ctg ggg gtg ttg tee ctg cga ctg get gac 32320 
Phe Gin Glu Lys Pro Leu Gly Val Leu Ser Leu Arg Leu Ala Asp 
1510 1515 1520 



ccc gtc acc acc aag aac ggg gaa ate acc etc aag ctg gga gag 

Pro Val Thr Thr Lys Asn Gly Glu He Thr Leu Lys Leu Gly Glu 
1525 1530 1535 

ggg gtg gac etc gac teg teg gga aaa etc ate tec aac acg gec 

Gly Val Asp Leu Asp Ser Ser Gly Lys Leu He Ser Asn Thr Ala 



1540 1545 



1550 



32365 



32410 



acc aag gec gec gec cct etc agt att tea aac aac acc att tec 32455 
Thr Lys Ala Ala Ala Pro Leu Ser He Ser Asn Asn Thr He Ser 
1555 1560 1565 

ctt aaa act get gee cct ttc tac aac aac aat gga act tta age 32500 
Leu Lys Thr Ala Ala Pro Phe Tyr Asn Asn Asn Gly Thr Leu Ser 
1570 1575 1580 

etc aat gtc tec aca cca tta gca gta ttt ccc aca ttt aac act 32545 
Leu Asn Val Ser Thr Pro Leu Ala Val Phe Pro Thr Phe Asn Thr 
1585 1590 1595 

tta ggc ata agt ctt gga aac ggt ctt cag act tea aat aag ttg 32590 
Leu Gly He Ser Leu Gly Asn Gly Leu Gin Thr Ser Asn Lys Leu 
1^00 1605 1610 

ttg act gta caa eta act cat cct ctt aca ttc age tea aat age 32635 
Leu Thr Val Gin Leu Thr His Pro Leu Thr Phe Ser Ser Asn Ser 
1615 1620 1625 

ate aca gta aaa aca gac aaa ggg eta tat att aac tec agt gga 32680 
He Thr Val Lys Thr Asp Lys Gly Leu Tyr He Asn Ser Ser Gly 
1630 1635 1640 

aac aga gga ctt gag get aat ata age eta aaa aga gga eta gtt 32725 
Asn Arg Gly Leu Glu Ala Asn He Ser Leu Lys Arg Gly Leu Val 
1645 1650 1655 

ttt gac ggt aat get att gca aca tat att gga aat ggc tta gac 32770 
Phe Asp Gly Asn Ala He Ala Thr Tyr He Gly Asn Gly Leu Asp 
1660 1665 1670 

tat gga tct tat gat agt gat gga aaa aca aga ccc gta att acc 32815 
Tyr Gly Ser Tyr Asp Ser Asp Gly Lys Thr Arg Pro Val He Thr 
1675 1680 1685 

aaa att gga gca gga tta aat ttt gat get aac aaa gca ata get 32860 
Lys He Gly Ala Gly Leu Asn Phe Asp Ala Asn Lys Ala He Ala 
1690 1695 1700 



59 



WO 03/046124 



PCT/US02/33645 



gtc aaa eta ggc aca ggt tta agt ttt gac tec get ggt gec ttg 32905 
Val Lys Leu Gly Thr Gly Leu Ser Phe Asp Ser Ala Gly Ala Leu 
1705 1710 1715 

aca get gga aac aaa cag gat gac aag eta aca ctt tgg act acc 32950 
Thr Ala Gly Asn Lys Gin Asp Asp Lys Leu Thr Leu Trp Thr Thr 
1720 1725 1730 

cct gac cca age cct aat tgt caa tta ctt tea gac aga gat gec 32995 
Pro Asp Pro Ser Pro Asn Cys Gin Leu Leu Ser Asp Arg Asp Ala 
1735 1740 1745 

aaa ttt act etc tgt ctt aca aaa tgc ggt agt caa ata eta ggc 33040 
Lys Phe Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin lie Leu Gly 
1750 1755 1760 

act gtg gca gtg gcg get gtt act gta gga tea gca eta aat cca 33085 
Thr Val Ala Val Ala Ala Val Thr Val Gly Ser Ala Leu Asn Pro 
1765 1770 1775 

att aat gac aca gtc aaa age gec ata gtt ttc ctt aga ttt gat 33130 
He Asn Asp Thr Val Lys Ser Ala He Val Phe Leu Arg Phe Asp 
1780 1785 1790 

tec gat ggt gta etc atg tea aac tea tea atg gta ggt gat tac 33175 
Ser Asp Gly Val Leu Met Ser Asn Ser Ser Met Val Gly Asp Tyr 
1795 1800 1805 

tgg aac ttt agg gag gga cag acc act caa agt gta gee tat aca 33220 
Trp Asn Phe Arg Glu Gly Gin Thr Thr Gin Ser Val Ala Tyr Thr 
1810 1815 1820 

aat get gtg gga ttc atg cca aat ata ggt gca tat cca aaa acc 33265 
Asn Ala Val Gly Phe Met Pro Asn lie Gly Ala Tyr Pro Lys Thr 
1825 1830 " 1835 

caa agt aaa aca cct aaa aat age ata gtc agt cag gta tat tta 33310 
Gin Ser Lys Thr Pro Lys Asn Ser He Val Ser Gin Val Tyr Leu 
1840 1845 1850 

act gga gaa act act atg cca atg aca eta acc ata act ttc aat 33355 
Thr Gly Glu Thr Thr Met Pro Met Thr Leu Thr He Thr Phe Asn 
1855 1860 1865 

ggc act gat gaa aaa gac aca acc cca gtt age acc tac tct atg 33400 
Gly Thr Asp Glu Lys Asp Thr Thr Pro Val Ser Thr Tyr Ser Met 
1870 1875 1880 

act ttt aca tgg cag tgg act gga gac tat aag gac aaa aat att 33445 
Thr Phe Thr Trp Gin Trp Thr Gly Asp Tyr Lys Asp Lys Asn He 
1885 1890 1895 

acc ttt get acc aac tea ttc tct ttt tec tac ate gec cag gaa 33490 
Thr Phe Ala Thr Asn Ser Phe Ser Phe Ser Tyr He Ala Gin Glu 
1900 1905 1910 

taa tcccacccag caagccaacc ccttttccca ccacctttgt ctatatggaa 33543 
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actctgaaac 
gagcagttat 
cagccttgaa 
cagtttcaga 
gcatctgcac 
tctggaagaa 
gtgtcgcatc 
ggggttcggg 
ggtgcggcgg 
caggaccacc 
gggaaggatg 
cctccagaag 
ccggtaccac 
ggccagcacc 
gaggacccac 
gcacaggcat 
atcccagggc 
cacataactt 
caccagagaa 
gtgatggcgg 
cattttcgta 
ggtctcggcg 
gcagcagatc 
catcgaccac 
tgacggcggg 
ggagcacttc 
aaataacagc 
aagcctccac 



agaaaaataa 
ttttcctcca 
catctgaatg 
gcgagccagt 
ctcacagctc 
gcagaagagc 
aggccccgca 
tccagggact 
gcgcagcagc 
aggttgttca 
ctacccacgt 
acgctgccca 
atcaccctct 
gccccgcccg 
cgctcgtacc 
atgctcatgc 
acggggaact 
acattgtgca 
gcgcgggtct 
gacgcggctg 
cttgctgtag 
cttggaacgc 
tagggcctca 
cgtggaatgg 
ggagggaaga 
aaaatgaagg 
caggtcaaag 
gcgcacatcc 



agttcaagtg 
ccctcccagg 
ccattggtga 
ctcggatcgg 
aacagctgag 
ggcggtggga 
gcagtcgctg 
ccctcagcat 
gcatgcgaat 
acagtccata 
ggccgtcgta 
tgtacatgat 
ggttgaacat 
ccatgcagcg 
cgtggatcat 
atctcttcag 
cttgcaggac 
tggacagggt 
cggtctcctc 
atcgtgttct 
cagaacctgg 
tcggtgttaa 
ggagtgatga 
gccaggccca 
acaggaagaa 
tcacggagat 
gtgatacggt 
agaaacaaga 



ttttattgaa 
acatggaata 
tggacatgct 
tcagggagat 
gattgtcctc 
atcatagtcc 
ccgccgccgc 
gatgcccacg 
ctcgctcagg 
gttcaacacg 
ccagatcctc 
ctccttgggc 
gcagccccgg 
aagagacccc 
ctgggagctg 
cactctcagc 
agcgaacccc 
atcgcaatca 
acagcgtggt 
cgaccgtgtc 
tccgggcgct 
agttgtaaaa 
agatcccatc 
gccagatgat 
ccatgattaa 
ggcacctctc 
tctcgagatg 
caatagcgaa 



tcaacagttt 
caccaccctc 
tttggtctcc 
gaaaccctcc 
ggtggtcggg 
gcgaacggga 
tccgtcaagc 
gccctcagca 
tcactgcagt 
ctccagccga 
aggtaaatca 
atgtggcggt 
atgatcctgc 
ggatcccggc 
aacaagtcta 
tcctcggggg 
gcagaacagg 
ggcagcaccg 
aagggggccg 
atgatgcagt 
gcacaccgat 
cagccactct 
atgcctgata 
gcaattttgt 
cttttaatcc 
gcccccgctg 
ttccacggtg 
agcgggaggg 



tacaggactc 
tccccccgca 
acgttccaca 
gggcactccc 
atcacggtta 
tcggccggtg 
tgctgctcag 
tcagtcgtct 
acgtgcaaca 
aactcatcgc 
agtggcgctc 
tcaccacctc 
ggaaccacag 
aatgacaatg 
tgttggcaca 
tcaaaaccat 
gcaatcctcg 
ggtgatcctc 
gccgatacgg 
tgctttcgga 
cgccggcggc 
ctcagaccgt 
gctctgatca 
tgggtttcgg 
aaacggtctc 
tgttggtgga 
gcttccagca 
ttctctaatt 



33603 
33663 
33723 
33783 
33843 
33903 
33963 
34023 
34083 
34143 
34203 
34263 
34323 
34383 
34443 
34503 
34563 
34623 
34683 
34743 
34803 
34863 
34923 
34983 
35043 
35103 
35163 
35223 
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cctcaaccat 


catgttacac 


tcctgcacca 


tccccagata 


attttcattt 


ttccagcctt 


35283 


gaatgattcg 


aactagttcc 


tgaggtaaat 


ccaagccagc 


catgataaaa 


agctcgcgca 


35343 


gagcaccctc 


caccggcatt 


cttaagcaca 


ccctcataat 


tccaagatat 


tctgctcctg 


35403 


gttcacctgc 


agcagattga 


caagcggaat 


atcaaaatct 


ctgccgcgat 


ccctgagctc 


35463 


ctccctcagc 


aataactgta 


agtactcttt 


catatcgtct 


ccgaaatttt 


tagccatagg 


35523 


acccccagga 


ataagagaag 


ggcaagccac 


attacagata 


aaccgaagtc 


ccccccagtg 


35583 


agcattgcca 


aatgtaagat 


tgaaataagc 


atgctggcta 


gacccggtga 


tatcttccag 


35643 


ataactggac 


agaaaatcgg 


gtaagcaatt 


tttaagaaaa 


tcaacaaaag 


aaaaatcttc 


35703 


caggtgcacg 


tttagggcct 


cgggaacaac 


gatggagtaa 


gtgcaagggg 


tgcgttccag 


35763 


catggttagt 


tagctgatct 


gtaaaaaaac 


aaaaaataaa 


acattaaacc 


atgctagcct 


35823 


ggcgaacagg 


tgggtaaatc 


gttctctcca 


gcaccaggca 


ggccacgggg 


tctccggcgc 


35883 


gaccctcgta 


aaaattgtcg 


ctatgattga 


aaaccatcac 


agagagacgt 


tcccggtggc 


35943 


cggcgtgaat 


gattcgagaa 


gaagcataca 


cccccggaac 


attggagtcc 


gtgagtgaaa 


36003 


aaaagcggcc 


gaggaagcaa 


tgaggcacta 


caacgctcac 


tctcaagtcc 


agcaaagcga 


36063 


tgccatgcgg 


atgaagcaca 


aaattttcag 


gtgcgtaaaa 


aatgtaatta 


ctcccctcct 


36123 


gcacaggcag 


cgaagctccc 


gatccctcca 


gatacacata 


caaagcctca 


gcgtccatag 


36183 


cttaccgagc 


ggcagcagca 


gcggcacaca 


acaggcgcaa 


gagtcagaga 


aaagactgag 


36243 


ctctaacctg 


tccgcccgct 


ctctgctcaa 


tatatagccc 


cagatctaca 


ctgacgtaaa 


36303 


ggccaaagtc 


taaaaatacc 


cgccaaataa 


tcacacacgc 


ccagcacacg 


cccagaaacc 


36363 


ggtgacacac 


tcagaaaaat 


acgcgcactt 


cctcaaacgg 


ccaaactgcc 


gtcatttccg 


36423 


ggttcccacg 


ctacgtcatc 


aaaacacgac 


tttcaaattc 


cgtcgaccgt 


taaaaacatc 


36483 


acccgccccg 


cccctaacgg 


tcgccgctcc 


cgcagccaat 


caccttcctc 


cctccccaaa 


36543 


ttcaaacagc 


tcatttgcat 


attaacgcgc 


accaaaagtt 


tgaggtatat 


tattgatgat 


36603 


g 












36604 



<210> 6 

<211> 529 

<212> PRT 

<213> chimpanzee adenovirus serotype Pan6 
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<400> 6 

Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser 
1 5 10 15 

Val Met Gin Gin Ala Val Ala Ala Ala Met Gin Pro Pro Leu Glu Ala 
20 25 30 

Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser 
35 40 45 

lie Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr 
50 55 60 

Leu Val Asp Asn Lys Ser Ala Asp lie Ala Ser Leu Asn Tyr Gin Asn 
65 70 75 ~ 80 

Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp Phe Thr 
85 90 95 

Pro Thr Glu Ala Ser Thr Gin Thr lie Asn Phe Asp Glu Arg Ser Arg 
100 105 110 

Trp Gly Gly Gin Leu Lys Thr lie Met His Thr Asn Met Pro Asn Val 
115 120 125 

Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val Ser 
130 135 140 

Arg Lys Thr Pro Asn Gly Val Asp Asp Asp Tyr Asp Gly Ser Gin Asp 
145 150 155 160 

Glu Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly Asn Phe 
165 170 * 175 

Ser Val Thr Met Thr lie Asp Leu Met Asn Asn Ala lie lie Asp Asn 
180 185 190 

Tyr Leu Ala Val Gly Arg Gin Asn Gly Val Leu Glu Ser Asp lie Gly 
195 200 205 

Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr 
210 215 " ' 220 

Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp 
225 230 235 240 

lie Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser Arg Leu 
245 250 255 

Ser Asn Leu Leu Gly lie Arg Lys Arg Gin Pro Phe Gin Glu Gly Phe 
260 265 270 

Gin lie Leu Tyr Glu Asp Leu Glu Gly Gly Asn lie Pro Ala Leu Leu 
275 280 285 
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Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu Asp Ser Thr Ala Ala Ala 
290 295 300 

Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val Arg Gly Asp Asn Phe 
305 310 315 320 

Ala Ser Ala Ala Ala Ala Ala Glu Ala Ala Glu Thr Glu Ser Lys lie 
325 330 335 

Val lie Gin Pro Val Glu Lys Asp Ser Lys Asp Arg Ser Tyr Asn Val 
340 345 350 

Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg Ser Trp Tyr Leu Ala Tyr 
355 360 365 

Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu Leu Thr 
370 375 380 

Thr Ser Asp Val Thr Cys Gly Val Glu Gin Val Tyr Trp Ser Leu Pro 
385 390 395 " 400 

Asp Met Met Gin Asp Pro Val Thr Phe Arg Ser Thr Arg Gin Val Ser 
405 410 " 415 

Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser Lys Ser 
420 425 430 

Phe Phe Asn Glu Gin Ala Val Tyr Ser Gin Gin Leu Arg Ala Phe Thr 
435 440 445 

Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gin lie Leu Val 
450 455 460 

Arg Pro Pro Ala Pro Thr lie Thr Thr Val Ser Glu Asn Val Pro Ala 
465 470 475 480 

Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser lie Arg Gly Val 
485 490 ~ 495 

Gin Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val 
500 505 510 

Tyr Lys Ala Leu Gly Val Val Ala Pro Arg Val Leu Ser Ser Arg Thr 
515 520 525 

Phe 



<210> 7 
<211> 942 
<212> PRT 

<213> chimpanzee adenovirus serotype Pan6 
<400> 7 

Met Ala Thr Pro Ser Met Leu Pro Gin Trp Ala Tyr Met His lie Ala 
15 10 15 
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Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 
20 25 30 

Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 
35 40 45 

Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 
50 55 60 

Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr 
65 70 75 80 

Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 
85 90 95 

Ala Ser Thr Tyr Phe Asp lie Arg Gly Val Leu Asp Arg Gly Pro Ser 
100 ' 105 110 

Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 
115 120 125 

Ala Pro Asn Ser Ser Gin Trp Glu Gin Ala Lys Thr Gly Asn Gly Gly 
130 135 140 

Thr Met Glu Thr His Thr Tyr Gly Val Ala Pro Met Gly Gly Glu Asn 
145 150 155 160 

lie Thr Lys Asp Gly Leu Gin lie Gly Thr Asp Val Thr Ala Asn Gin 
165 170 175 

Asn Lys Pro lie Tyr Ala Asp Lys Thr Phe Gin Pro Glu Pro Gin Val 
180 185 190 

Gly Glu Glu Asn Trp Gin Glu Thr Glu Asn Phe Tyr Gly Gly Arg Ala 
195 200 205 

Leu Lys Lys Asp Thr Asn Met Lys Pro Cys Tyr Gly Ser Tyr Ala Arg 
210 215 220 

Pro Thr Asn Glu Lys Gly Gly Gin Ala Lys Leu Lys Val Gly Asp Asp 
225 230 235 240 

Gly Val Pro Thr Lys Glu Phe Asp lie Asp Leu Ala Phe Phe Asp Thr 
245 250 255 

Pro Gly Gly Thr Val Asn Gly Gin Asp Glu Tyr Lys Ala Asp lie Val 
260 265 270 

Met Tyr Thr Glu Asn Thr Tyr Leu Glu Thr Pro Asp Thr His Val Val 
275 280 285 

Tyr Lys Pro Gly Lys Asp Asp Ala Ser Ser Glu lie Asn Leu Val Gin 
290 ~ 295 300 

Gin Ser Met Pro Asn Arg Pro Asn Tyr lie Gly Phe Arg Asp Asn Phe 
305 310 315 320 
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lie Gly Leu Met Tyr 
325 

Gly Gin Ala Ser Gin 
340 

Thr Glu Leu Ser Tyr 
355 

Arg Tyr Phe Ser Met 
370 

Val Arg lie lie Glu 
385 

Cys Phe Pro Leu Asp 
405 

Lys Val Lys Asp Gly 
42 0 

Asp Asp Thr Val Ala 
435 

Ala Met Glu lie Asn 
450 

Ser Asn Val Ala Leu 
465 

Asn Val Thr Leu Pro 
485 

Arg Val Thr Pro Pro 
500 

Arg Trp Ser Leu Asp 
515 

Arg Asn Ala Gly Leu 
530 

Tyr Val Pro Phe His 
545 

Ser Leu Leu Leu Leu 
565 

Lys Asp Val Asn Met 
580 

Thr Asp Gly Ala Ser 
595 

Phe Phe Pro Met Ala 
610 



Tyr Asn Ser Thr Gly Asn 
330 

Leu Asn Ala Val Val Asp 
345 

Gin Leu Leu Leu Asp Ser 
360 

Trp Asn Gin Ala Val Asp 
375 

Asn His Gly Val Glu Asp 
390 395 

Gly Ser Gly Thr Asn Ala 
410 

Gin Asp Gly Asp Val Glu 
425 

Ala Arg Asn Gin Leu Cys 
440 

Leu Gin Ala Asn Leu Trp 
455 

Tyr Leu Pro Asp Ser Tyr 
470 475 

Thr Asn Thr Asn Thr Tyr 
490 

Ser Leu Val Asp Ala Tyr 
505 

Pro Met Asp Asn Val Asn 
520 

Arg Tyr Arg Ser Met Leu 
535 

lie Gin Val Pro Gin Lys 
550 555 

Pro Gly Ser Tyr Thr Tyr 
570 

lie Leu Gin Ser Ser Leu 
585 

lie Ala Phe Thr Ser lie 
600 

His Asn Thr Ala Ser Thr 
615 



Met Gly Val Leu Ala 
335 

Leu Gin Asp Arg Asn 
350 

Leu Gly Asp Arg Thr 
365 

Ser Tyr Asp Pro Asp 
380 

Glu Leu Pro Asn Tyr 
400 

Ala Tyr Gin Gly Val 
415 

Ser Glu Trp Glu Asn 
430 

Lys Gly Asn lie Phe 
445 

Arg Ser Phe Leu Tyr 
460 

Lys Tyr Thr Pro Thr 
4 8 0 

Asp Tyr Met Asn Gly 
495 

Leu Asn lie Gly Ala 
510 

Pro Phe Asn His His 
525 

Leu Gly Asn Gly Arg 
540 

Phe Phe Ala lie Lys 
560 

Glu Trp Asn Phe Arg 
575 

Gly Asn Asp Leu Arg 
590 

Asn Leu Tyr Ala Thr 
605 

Leu Glu Ala Met Leu 
620 
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Arg Asn Asp Thr Asn Asp Gin Ser Phe Asn Asp Tyr Leu Ser Ala Ala 
625 630 635 640 

Asn Met Leu Tyr Pro He Pro Ala Asn Ala Thr Asn Val Pro He Ser 
645 650 655 

He Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg 
660 665 670 

Leu Lys Thr Arg Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tvr 
675 680 685 

Phe Val Tyr Ser Gly Ser He Pro Tyr Leu Asp Gly Thr Phe Tyr Leu 
690 695 700 

Asn His Thr Phe Lys Lys Val Ser He Thr Phe Asp Ser Ser Val Ser 
705 710 715 720 

Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn Glu Phe Glu He Lys 
725 730 735 

Arg Thr Val Asp Gly Glu Gly Tyr Asn Val Ala Gin Cys Asn Met Thr 
740 745 750 

Lys Asp Trp Phe Leu Val Gin Met Leu Ala His Tyr Asn He Gly Tyr 
755 760 765 

Gln Phe Tyr Val Pro Glu G1 y Tyr Lys Asp Arg Met Tyr Ser Phe 

770 775 78Q 

Phe Arg Asn Phe Gin Pro Met Ser Arg Gin Val Val Asp Glu Val Asn 
785 790 795 ' 800 

Tyr Lys Asp Tyr Gin Ala Val Thr Leu Ala Tyr Gin His Asn Asn Ser 
805 810 815 

Gly Phe Val Gly Tyr Leu Ala Pro Thr Met Arg Gin Gly Gin Pro Tvr 
820 825 830 

Pro Ala Asn Tyr Pro Tyr Pro Leu He Gly Lys Ser Ala Val Ala Ser 
835 840 845 

Val Thr Gin Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg He Pro 
850 855 860 

Phe Ser Ser Asn Phe Met Ser Met Gly Ala Leu Thr Asp Leu Gly Gin 
865 870 875 880 

Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Asn Phe Glu 
885 890 895 

Val Asp Pro Met Asp Glu Ser Thr Leu Leu Tyr Val Val Phe Glu Val 
900 905 910 

Phe Asp Val Val Arg Val His Gin Pro His Arg Gly Val He Glu Ala 
915 920 925 
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Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr 
930 935 940 



<210> 8 

<211> 443 

<212> PRT 

<213> chimpanzee adenovirus serotype Pan6 

<400> 8 

Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr 
.1 5 10 15 

Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe lie Asn Pro Pro 
20 25 30 

Phe Val Ser Ser Asp Gly Phe Gin Glu Lys Pro Leu Gly Val Leu Ser 
35 40 45 

Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu lie Thr Leu 
50 55 60 

Lys Leu Gly Glu Gly Val Asp Leu Asp Ser Ser Gly Lys Leu lie Ser 
65 70 75 80 

Asn Thr Ala Thr Lys Ala Ala Ala Pro Leu Ser lie Ser Asn Asn Thr 
85 90 95 

He Ser Leu Lys Thr Ala Ala Pro Phe Tyr Asn Asn Asn Gly Thr Leu 
100 105 110 

Ser Leu Asn Val Ser Thr Pro Leu Ala Val Phe Pro Thr Phe Asn Thr 
115 120 125 

Leu Gly He Ser Leu Gly Asn Gly Leu Gin Thr Ser Asn Lys Leu Leu 
130 135 140 

Thr Val Gin Leu Thr His Pro Leu Thr Phe Ser Ser Asn Ser He Thr 
145 150 155 160 

Val Lys Thr Asp Lys Gly Leu Tyr He Asn Ser Ser Gly Asn Arg Gly 
165 170 175 

Leu Glu Ala Asn He Ser Leu Lys Arg Gly Leu Val Phe Asp Gly Asn 
180 185 190 

Ala He Ala Thr Tyr He Gly Asn Gly Leu Asp Tyr Gly Ser Tyr Asp 
195 200 ~ 205 

Ser Asp Gly Lys Thr Arg Pro Val He Thr Lys He Gly Ala Gly Leu 
210 215 220 

Asn Phe Asp Ala Asn Lys Ala He Ala Val Lys Leu Gly Thr Gly Leu 
225 230 235 ~ 240 
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Ser Phe Asp Ser Ala Gly Ala Leu Thr Ala Gly Asn Lys Gin Asp Asp 
245 250 255 

Lys Leu Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Gin Leu 
260 265 270 

Leu Ser Asp Arg Asp Ala Lys Phe Thr Leu Cys Leu Thr Lys Cys Gly 
275 280 285 

Ser Gin lie Leu Gly Thr Val Ala Val Ala Ala Val Thr Val Gly Ser 
290 295 300 

Ala Leu Asn Pro lie Asn Asp Thr Val Lys Ser Ala lie Val Phe Leu 
305 310 315 320 

Arg Phe Asp Ser Asp Gly Val Leu Met Ser Asn Ser Ser Met Val Gly 
325 330 335 

Asp Tyr Trp Asn Phe Arg Glu Gly Gin Thr Thr Gin Ser Val Ala Tyr 
340 345 350 

Thr Asn Ala Val Gly Phe Met Pro Asn lie Gly Ala Tyr Pro Lys Thr 
355 360 365 

Gin Ser Lys Thr Pro Lys Asn Ser lie Val Ser Gin Val Tyr Leu Thr 
370 ~ 375 380 

Gly Glu Thr Thr Met Pro Met Thr Leu Thr lie Thr Phe Asn Gly Thr 
385 390 395 400 

Asp Glu Lys Asp Thr Thr Pro Val Ser Thr Tyr Ser Met Thr Phe Thr 
405 410 415 

Trp Gin Trp Thr Gly Asp Tyr Lys Asp Lys Asn lie Thr Phe Ala Thr 
420 425 430 

Asn Ser Phe Ser Phe Ser Tyr lie Ala Gin Glu 
435 440 



<210> 9 

<211> 36535 

<212> DNA 

<213> chimpanzee adenovirus serotype Pan7 
<220> 

<221> CDS 

<222> (13874) . . (15469) 

<223> L2 Penton 

<220> 

<221> CDS 

<222> (18288) . . (21086) 

<223> L3 Hexon 
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<220> 

<221> CDS 

<222> (32094) . . (33425) 

<223> L5 Fiber 

<400> 9 



catcatcaat 


aatatacctc 


aaacttttgg 


tgcgcgttaa 


tatgcaaatg 


agctgtttga 


60 


atttggggag 


ggaggaaggt 


gattggccga 


gagacgggcg 


accgttaggg 


gcggggcggg 


120 


tgacgttttt 


aatacgtggc 


cgtgaggcgg 


agccggtttg 


caagttctcg 


tgggaaaagt 


180 


gacgtcaaac 


gaggtgtggt 


ttgaacacgg 


aaatactcaa 


ttttcccgcg 


ctctctgaca 


240 


ggaaatgagg 


tgtttctggg 


cggatgcaag 


tgaaaacggg 


ccattttcgc 


gcgaaaactg 


300 


aatgaggaag 


tgaaaatctg 


agtaatttcg 


cgtttatggc 


agggaggagt 


atttgccgag 


360 


ggccgagtag 


actttgaccg 


attacgtggg 


ggtttcgatt 


accgtatttt 


tcacctaaat 


420 


ttccgcgtac 


ggtgtcaaag 


tccggtgttt. 


ttacgtaggc 


gtcagctgat 


cgccagggta 


480 


tttaaacctg 


cgctctctag 


tcaagaggcc 


actcttgagt 


gccagcgagt 


agagttttct 


540 


cctccgcgcc 


gcgagtcaga 


tctacacttt 


gaaagatgag 


gcacctgaga 


gacctgcccg 


600 


gtaatgtttt 


cctggctact 


gggaacgaga 


ttctggaatt 


ggtggtggac 


gccatgatgg 


660 


gtggcgaccc 


tcctgagccc 


cctaccccat 


ttgaggcgcc 


ttcgctgtac 


gatttgtatg 


720 


atctggaggt 


ggatgtgccc 


gagaacgacc 


ccaacgagga 


ggcggtgaat 


gatttgttta 


780 


gcgatgccgc 


gctgctggct 


gccgagcagg 


ctaatacgga 


ctctggctca 


gacagcgatt 


840 


cctctctcca 


taccccgaga 


cccggcagag 


gtgagaaaaa 


gatccccgag 


cttaaagggg 


900 


aagagctcga 


cctgcgctgc 


tatgaggaat 


gcttgcctcc 


gagcgatgat 


gaggaggacg 


960 


aggaggcgat 


tcgagctgca 


tcgaaccagg 


gagtgaaagc 


tgcgggcgaa 


agctttagcc 


1020 


tggactgtcc 


tactctgccc 


ggacacggct 


gtaagtcttg 


tgaatttcat 


cgcatgaata 


1080 


ctggagataa 


gaatgtgatg 


tgtgccctgt 


gctatatgag 


agcttacaac 


cattgtgttt 


1140 


acagtaagtg 


tgattaactt 


tagttgggaa 


ggcagagggt 


gactgggtgc 


tgactggttt 


1200 


atttatgtat 


atgttttttt 


atgtgtaggt 


cccgtctctg 


acgtagatga 


gacccccact 


1260 


tcagagtgca 


tttcatcacc 


cccagaaatt 


ggcgaggaac 


cgcccgaaga 


tattattcat 


1320 


agaccagttg 


cagtgagagt 


caccgggcgg 


agagcagctg 


tggagagttt 


ggatgacttg 


1380 


ctacagggtg 


gggatgaacc 


tttggacttg 


tgtacccgga 


aacgccccag 


gcactaagtg 


1440 


ccacacatgt 


gtgtttactt 


aaggtgatgt 


cagtatttat 


agggtgtgga 


gtgcaataaa 


1500 
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3060 


cccgcaagac 


atggcccgag 


ttcgagcaca 


aegtcatgae 


ccgatgcaat 


gtgcacctgg 


3120 


ggtcccgccg 


aggcatgttc 


atgccctacc 


agtgcaacat 


gcaatttgtg 


aaggtgctgc 


3180 


tggagcccga 


tgccatgtcc 


agagtgagee 


tgacgggggt 


gtttgacatg 


aatgtggagc 


3240 
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tgtggaaaat 


tctgagatat 


gatgaatcca 


agaccaggtg 


ccgggcctgc 


gaatgcggag 


3300 


gcaagcacgc 


caggcttcag 


cccgtgtgtg 


tggaggtgac 


ggaggacctg 


cgacccgatc 


3360 


atttggtgtt 


gtcctgcaac 


gggacggagt 


tcggctccag 


cggggaagaa 


tctgactaga 


3420 


gtgagtagtg 


tttgggggag 


gtggagggct 


tgtatgaggg 


gcagaatgac 


taaaatctgt 


3480 


gtttttctgt 


gtgttgcagc 


agcatgagcg 


gaagcgcctc 


ctttgaggga 


ggggtattca 


3540 


gcccttatct 


gacggggcgt 


ctcccctcct 


gggcgggagt 


gcgtcagaat 


gtgatgggat 


3600 


ccacggtgga 


cggccggccc 


gtgcagcccg 


cgaactcttc 


aaccctgacc 


tacgcgaccc 


3660 


tgagctcctc 


gtccgtggac 


gcagctgccg 


ccgcagctgc 


tgcttccgcc 


gccagcgccg 


3720 


tgcgcggaat 


ggccctgggc 


gccggctact 


acagctctct 


ggtggccaac 


tcgacttcca 


3780 


ccaataatcc 


cgccagcctg 


aacgaggaga 


agctgctgct 


gctgatggcc 


cagctcgagg 


3840 


ccctgaccca 


gcgcctgggc 


gagctgaccc 


agcaggtggc 


tcagctgcag 


gcggagacgc 


3900 


gggccgcggt 


tgccacggtg 


aaaaccaaat 


aaaaaatgaa 


tcaataaata 


aacggagacg 


3960 


gttgttgatt 


ttaacacaga 


gtcttgaatc 


tttatttgat 


ttttcgcgcg 


cggtaggccc 


4020 


tggaccaccg 


gtctcgatca 


ttgagcaccc 


ggtggatttt 


ttccaggacc 


cggtagaggt 


4080 


gggcttggat 


gttgaggtac 


atgggcatga 


gcccgtcccg 


ggggtggagg 


tagctccatt 


4140 


gcagggcctc 


gtgctcgggg 


gtggtgttgt 


aaatcaccca 


gtcatagcag 


gggcgcaggg 


4200 


cgtggtgctg -cacgatgtcc 


ttgaggagga 


gactgatggc 


cacgggcagc 


cccttggtgt 


4260 


aggtgttgac 


gaacctgttg 
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atggtacttg atctccacgg cgccgttggt ggcgacgtcc acggcttgca gggtcccgtg 8400 

cccctggggc gccaccaccg tgccccgttt cttcttgggc gctgcttcca tgccggtcag 8460 

aagcggcggc gaggacgcgc gccgggcggc aggggcggct cgggacccgg aggcaggggc 852 0 

ggcaggggca cgtcggcgcc gcgcgcgggc aggttctggt actgcgcccg gagaagactg 858 0 

gcgtgagcga cgacgcgacg gttgacgtcc tggatctgac gcctctgggt gaaggccacg 8640 

ggacccgtga gtttgaacct gaaagagagt tcgacagaat caatctcggt atcgttgacg 8700 

gcggcctgcc gcaggatctc ttgcacgtcg cccgagttgt cctggtaggc gatctcggtc 8760 

atgaactgct cgatctcctc ctcctgaagg tctccgcggc cggcgcgctc gacggtggcc 8820 

gcgaggtcgt tggagatgcg gcccatgagc tgcgagaagg cgttcatgcc ggcctcgttc 8880 

cagacgcggc tgtagaccac ggctccgtcg gggtcgcgcg cgcgcatgac cacctgggcg 8940 

aggttgagct cgacgtggcg cgtgaagacc gcgtagttgc agaggcgctg gtagaggtag 9000 

ttgagcgtgg tggcgatgtg ctcggtgacg aagaagtaca tgatccagcg gcggagcggc 9060 

atctcgctga cgtcgcccag ggcttccaag cgctccatgg cctcgtagaa gtccacggcg 9120 

aagttgaaaa actgggagtt gcgcgccgag acggtcaact cctcctccag aagacggatg 9180 

agctcagcga tggtggcgcg cacctcgcgc tcgaaggccc cggggggctc ctcttcttcc 9240 

atctcttcct cctccactaa catctcttct acttcctcct caggaggcgg cggcggggga 9300 

ggggccctgc gtcgccggcg gcgcacgggc agacggtcga tgaagcgctc gatggtctcc 9360 

ccgcgccggc gacgcatggt ctcggtgacg gcgcgcccgt cctcgcgggg ccgcagcgtg 9420 

aagacgccgc cgcgcatctc caggtggccg ccgggggggt ctccgttggg cagggagagg 9480 

gcgctgacga tgcatcttat caattggccc gtagggactc cgcgcaagga cctgagcgtc 9540 

tcgagatcca cgggatccga aaaccgctga acgaaggctt cgagccagtc gcagtcgcaa 9600 

ggtaggctga gcccggtttc ttgttcttcg gggatttcgg gaggcgggcg ggcgatgctg 9660 

ctggtgatga agttgaagta ggcggtcctg agacggcgga tggtggcgag gagcaccagg 9720 

tccttgggcc cggcttgctg gatgcgcaga cggtcggcca tgccccaggc gtggtcctga 9780 

cacctggcga ggtccttgta gtagtcctgc atgagccgct ccacgggcac ctcctcctcg 9840 

cccgcgcggc cgtgcatgcg cgtgagcccg aacccgcgct ggggctggac gagcgccagg 9900 

tcggcgacga cgcgctcggc gaggatggcc tgctgtatct gggtgagggt ggtctggaag 9960 

tcgtcgaagt cgacgaagcg gtggtaggct ccggtgttga tggtatagga gcagttggcc 10020 

atgacggacc agttgacggt ctggtggccg ggtcgcacga gctcgtggta cttgaggcgc 10080 
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cttctcggtg 


ctgagcctgg 


11640 


gcaagtacta 


cgctaggaag 


atctacaaga 


ccccgtacgt 


gcccatagac 


aaggaggtga 


11700 


agatcgacgg 


gttttacatg 


cgcatgaccc 


tgaaagtgct 


gaccctgagc 


gacgatctgg 


11760 
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gggtgtaccg 


caacgacagg 


atgcaccgcg 


cggtgagcgc 


cagccgccgg 


ccrccracrctrya 

*— ■ y 3 ^* 3 3 


11820 


gcgaccagga 


gctgatgcac 


agcctgcagc 


gggccctgac 


z}z?z)z?^^z?z)z3 


accgaggggg 


11880 


agagctactt 


tgacatgggc 


gcggacctgc 


gctgacaacc 

zJ zJ z? 3 


cagccgccgg 


creefc t* crna a rr 

y w ^ i — y y ci ci y 


1 1 94 0 


ctgccggcgg 


ttccccctac 


qtqqa.qqa.qq 

3 3 3 3 3 3 3 


tggacgatga 


acf acrcfacfcracr 


rfrfpffarfha rr 

^-j V-j ^ y d y O CL ^ — 


12000 


tggaagactg 


atggcgcgac 


cgtatttttg 


ctagatgcag 


caacaaccac 




1 9 0 fin 


tcccgcgatg 


cgggcggcgc 


tgcagagcca 


gccgtcccrcrc 


attaactcct 


eggacgatt g 


12120 


gacccaggcc 


atgcaacgca 


tcatggcgct 


gacgacccgc 


aatcccgaag 


cctttagaca 


12180 


gcagcctcag 


gccaaccggc 


tctcggccat 


cctgqaqgcc 

33 3^3 


gtgqtqccct 

3 3 3 ^ 3 w ^" w ^ 


cgcgctcgaa 


1224 0 


ccccacgcac 


gagaaggtgc 


tggccatcgt 


gaacgcgctg 


gtggagaaca 


aggccatccg 


12300 


cggcgacgag 


gccgggctgg 


tgtacaacgc 


get gctggag 


ca cat act ccc* 


rr "h ;^ pap rarr 
y ^— - i_ ci i^, a. ex i 




caccaacgtg 


cagacgaacc 


tggaccgcat 


ggtgaccgac 


crfccrccfccracfcf 

3 3 ^ — Zj 1 — ZJ 3 3 


33 3 ^-^y CL 


10400 


gcgcgagcgg 


ttccaccgcg 


agtcgaacct 


gggctccatg 


gtggcgctga 


3 r , rrr , ri~+~ rrt" 


x <£. *± 0 \j 


gagcacgcag 


cccgccaacg 


tgccccqaqg 


ccaggaggac 


tacaccaact 


tralraarrrr 

* — ci i— v— ca. <~j v_ y 


1 9 S4 n 

-L «J *i 


gctgcggctg 


atggtggccg 


aggtgcccca 


aagcaacrcrta 

3^Z?^3 r'r) *- 3 


taccagt egg 


crcr cccrcracta 
^3 y 3I 01 


12 600 


cttcttccag 


accagtcgcc 


agggcttgca 


gaeegtgaac 


ctgagecagg 


cttt caagaa 


12660 


cttgcaggga 


ctgtggggcg 


tgcaggcccc 


crcrt cq qggac 

3 3 3 3 3 z> 


c cr c cr c q a c cr cr 


tcffcccracrccfc 

^ y ^^-» y ci y ^— * l_- 


1272 0 


gctgacgccg 


aactcgcgcc 


tgctgctgct 


gctggtaaca 

3 w 3 3 r3 ^ r) 


cccttcacaa 




1 91 ft 0 


cgtgagccgc 


gactcgtacc 


tgggctacct 


gcttaacctg 


taccgegagg 


u^y y y uci 


1 ? ft 4 0 

•L Z. O 1 W 


ggcgcacgtg 


gacgagcaga 


cctaccagga 


gatcacccac 


qtqaqccq eg 


cgctgggcca 


12900 


ggaggacccg 


ggcaacctgg 


aggccaccct 


gaacttcctg 


ctgaccaacc 


ggtcgcagaa 


12960 

JU *S \J \J 


gatcccgccc 


cagtacgcgc 


tgagcaccga 


gqaqgaqcqc 

33 Zj zi ZJ ZJ 


at cctgcgct 


aegt gcagca 


13020 


gagcgtgggg 


ctgttcctga 


tgcaggaggg 

ZD Zf 3 3 3 


ggccacgccc 


agcgccgcgc 


t ccra ca tcra c 


1308 0 


cgcgcgcaac 


atggagccca 


gcatgtacgc 


tcgcaaccgc 


ccgttcat ca 


ataagctgat 


13140 


ggactacttg 


catcgggcgg 


ccgccatgaa 


ctcggactac 


tttaccaacg 


ccat cttgaa 


13200 


cccgcactgg 


ctcccgccgc 


ccgggttcta 


cacqqcfcaaa 


tacgacatgc 


ccgaccccaa 


13260 


cgacggguuc 


ctgtgggacg 


acgtggacag 


cagcgtgttc 


tcgccgcgcc 


ccgccaccac 


13320 


cgtgtggaag 


aaagagggcg 


gggaccggcg 


gccgtcctcg 


gcgctgtccg 


gtcgcgcggg 


13380 


tgctgccgcg 


gcggtgcctg 


aggccgccag 


ccccttcccg 


agcctgccct 


tttcgctgaa 


13440 


cagcgtgcgc 


agcagcgagc 


tgggtcggct 


gacgcggccg 


cgcctgctgg 


gcgaggagga 


13500 
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gtacctgaac gactccttgt tgaggcccga gcgcgagaag aacttcccca ataacgggat 13560 

agagagcctg gtggacaaga tgagccgctg gaagacgtac gcgcacgagc acagggacga 13 62 0 

gccccgagct agcagcagcg caggcacccg tagacgccag cgacacgaca ggcagcgggg 13 68 0 

tctggtgtgg gacgatgagg attccgccga cgacagcagc gtgttggact tgggtgggag 1374 0 

tggtggtggt aacccgttcg ctcacttgcg cccccgtatc gggcgcctga tgtaagaatc 13800 

tgaaaaaata aaaaacggta ctcaccaagg ccatggcgac cagcgtgcgt tcttctctgt 13860 

tgtttgtagt agt atg atg agg cgc gtg tac ccg gag ggt cct cct ccc 13909 
Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro Pro 
15 10 



teg tac gag age gtg atg cag cag gcg gtg gcg gcg gcg atg cag ccc 
Ser Tyr Glu Ser Val Met Gin Gin Ala Val Ala Ala Ala Met Gin Pro 
15 20 25 



13957 



ccg ctg gag gcg cct tac gtg ccc ccg egg tac ctg gcg cct acg gag 
Pro Leu Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu 
30 35 40 



14005 



ggg egg aac age att cgt tac teg gag ctg gca ccc ttg tac gat ace 
Gly Arg Asn Ser lie Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr 
45 50 55 60 



14053 



ace egg ttg tac ctg gtg gac aac aag teg gcg gac ate gee teg ctg 
Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp lie Ala Ser Leu 
65 70 75 



14101 



aac tac cag aac gac cac age aac ttc ctg ace ace gtg gtg cag aac 
Asn Tyr Gin Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn 
80 85 90 



14149 



aac gat ttc acc ccc acg gag gee age acc cag ace ate aac ttt gac 
Asn Asp Phe Thr Pro Thr Glu Ala Ser Thr Gin Thr lie Asn Phe Asp 
95 100 105 



14197 



gag cgc teg egg tgg ggc ggc cag ctg aaa acc ate atg cac acc aac 
Glu Arg Ser Arg Trp Gly Gly Gin Leu Lys Thr lie Met His Thr Asn 
110 115 120 



14245 



atg ccc aac gtg aac gag ttc atg tac age aac aag ttc aag gcg egg 
Met Pro Asn Val Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg 
125 130 135 140 



14293 



gtg atg gtc teg cgc aag acc ccc aat ggg gtc gcg gtg gat gag aat 
Val Met Val Ser Arg Lys Thr Pro Asn Gly Val Ala Val Asp Glu Asn 
145 150 155 



14341 



tat gat ggt agt cag gac gag ctg act tac gag tgg gtg gag ttt gag 
Tyr Asp Gly Ser Gin Asp Glu Leu Thr Tyr Glu Trp Val Glu Phe Glu 
160 165 ~ 170 



14389 
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ctg ccc gag ggc aac ttc teg gtg acc atg acc ate gat ctg atg aac 14437 
Leu Pro Glu Gly Asn Phe Ser Val Thr Met Thr lie Asp Leu Met Asn 
175 180 185 

aac gec ate ate gac aac tac ttg gcg gtg ggg cgt cag aac ggg gtg 14485 
Asn Ala He He Asp Asn Tyr Leu Ala Val Gly Arg Gin Asn Gly Val 
190 195 200 

ctg gag age gac ate ggc gtg aag ttc gac acg cgc aac ttc egg ctg 14533 
Leu Glu Ser Asp He Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu 
205 210 215 220 

ggc tgg gac ccc gtg acc gag ctg gtg atg ccg ggc gtg tac acc aac 14581 
Gly Trp Asp Pro Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn 
225 230 ~ 235 

gag gee ttc cac ccc gac ate gtc ctg ctg ccc ggc tgc ggc gtg gac 14629 
Glu Ala Phe His Pro Asp lie Val Leu Leu Pro Gly Cys Gly Val Asp 
240 245 250 

ttc acc gag age cgc etc age aac ctg ctg ggc ate cgc aag egg cag 14677 
Phe Thr Glu Ser Arg Leu Ser Asn Leu Leu Gly He Arg Lys Arg Gin 
255 260 " 265 

ccc ttc cag gag ggc ttc cag ate ctg tac gag gac ctg gag ggg ggc 14725 
Pro Phe Gin Glu Gly Phe Gin He Leu Tyr Glu Asp Leu Glu Gly Gly 
270 275 280 

aac ate ccc gcg etc ttg gat gtc gaa gee tat gag aaa age aag gag 14773 
Asn He Pro Ala Leu Leu Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu 
285 290 295 300 

gag gee gee gca gcg gcg acc gca gee gtg gee acc gee tct acc gag 14821 
Glu Ala Ala Ala Ala Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu 
305 310 315 

gtg egg ggc gat aat ttt get age gee gcg gca gtg gee gag gcg get 14869 
Val Arg Gly Asp Asn Phe Ala Ser Ala Ala Ala Val Ala Glu Ala Ala 
320 325 330 

gaa acc gaa agt aag ata gtc ate cag ccg gtg gag aag gac age aag 14917 
Glu Thr Glu Ser Lys He Val He Gin Pro Val Glu Lys Asp Ser Lys 
335 340 345 

gac agg age tac aac gtg etc gcg gac aag aaa aac acc gee tac cgc 14965 
Asp Arg Ser Tyr Asn Val Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg 
35 0 355 ^ 3 60 

age tgg tac ctg gee tac aac tac ggc gac ccc gag aag ggc gtg cgc 15013 
Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg 
365 370 375 380 

tec tgg acg ctg etc acc acc teg gac gtc acc tgc ggc gtg gag caa 15061 
Ser Trp Thr Leu Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gin 
385 390 ~ 395 
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gtc tac tgg teg ctg ccc gac atg atg caa gac ccg gtc acc ttc cgc 15109 
Val Tyr Trp Ser Leu Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg 
400 405 410 

tec acg cgt caa gtt age aac tac ccg gtg gtg ggc gec gag etc ctg 15157 
Ser Thr Arg Gin Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu 
415 420 425 



ccc gtc tac tec aag age ttc ttc aac gag cag gee gtc tac teg cag 
Pro Val Tyr Ser Lys Ser Phe Phe Asn Glu Gin Ala Val Tyr Ser Gin 
430 435 440 



15205 



cag ctg cgc gee ttc acc teg etc acg cac gtc ttc aac cgc ttc ccc 
Gin Leu Arg Ala Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro 
445 450 455 



460 



15253 



gag aac cag ate etc gtc cgc ccg ccc gcg ccc acc att acc acc gtc 15301 

Glu Asn Gin lie Leu Val Arg Pro Pro Ala Pro Thr lie Thr Thr Val 
465 470 475 

agt gaa aac gtt cct get etc aca gat cac ggg acc ctg ccg ctg cgc 15349 

Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg 
480 485 1 490 



age agt ate egg gga gtc cag cgc gtg acc gtc act gac gee aga cgc 
Ser Ser lie Arg Gly Val Gin Arg Val Thr Val Thr Asp Ala Arg Arg 
495 500 505 



15397 



cgc acc tgc ccc tac gtc tac aag gee ctg ggc gta gtc gcg ccg cgc 
Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Arg 
510 515 520 



15445 



gtc etc teg age cgc acc ttc taa aaaatgtcca ttctcatctc geccagtaat 
Val Leu Ser Ser Arg Thr Phe 
525 530 



15499 



aacaceggtt 


ggggcctgcg 


cgcgcccagc 


aagatgtacg 


gaggegcteg 


ccaacgctcc 


15559 


acgcaacacc 


ccgtgcgcgt 


gcgcgggcac 


ttccgcgctc 


cctggggcgc 


cctcaagggc 


15619 


cgcgtgcgct 


cgcgcaccac 


cgtcgacgac 


gtgatcgacc 


aggtggtggc 


cgacgcgcgc 


15679 


aactacacgc 


ccgccgccgc 


gcccgcctcc 


accgtggacg 


ccgtcatcga 


cagcgtggtg 


15739 


gccgatgcgc 


gccggtacgc 


ccgcgccaag 


agccggcggc 


ggcgcatcgc 


ccggcggcac 


15799 


cggagcaccc 


ccgccatgcg 


cgcggcgcga 


gccttgctgc 


geagggecag 


gcgcacggga 


15859 


cgcagggcca 


tgetcaggge 


ggccagacgc 


gcggcctccg 


gcagcagcag 


cgccggcagg 


15919 


acccgcagac 


gcgcggccac 


ggcggcggcg 


gcggccatcg 


ccagcatgtc 


ccgcccgcgg 


15979 


cgcggcaacg 


tgtactgggt 


gcgcgacgcc 


gccaccggtg 


tgcgcgtgcc 


cgtgcgcacc 


16039 


cgcccccctc 


gcacttgaag 


atgetgaett 


cgcgatgttg 


atgtgtccca 


gcggcgagga 


16099 


ggatgtccaa 


gegcaaatae 


aaggaagaga 


tgctccaggt 


catcgcgcct 


gagatctacg 


16159 
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gccccgcggt 


gaaggaggaa 


agaaagcccc 


gcaaactgaa 


gcgggtcaaa 


aaggacaaaa 


16219 


aggaggagga 


agatgtggac 


ggactggtgg 


agtttgtgcg 


cgagttcgcc 


ccccggcggc 


16279 


gcgtgcagtg 


gcgcgggcgg 


aaagtgaaac 


cggtgctgcg 


gcccggcacc 


acggtggtct 


16339 


tcacgcccgg 


cgagcgttcc 


ggctccgcct 


ccaagcgctc 


ctacgacgag 


gtgtacgggg 


16399 


acgaggacat 


cctcgagcag 


gcggtcgagc 


gtctgggcga 


gtttgcttac 


ggcaagcgca 


16459 


gccgccccgc 


gcccttgaaa 


gaggaggcgg 


tgtccatccc 


gctggaccac 


ggcaacccca 


16519 


cgccgagcct 


gaagccggtg 


accctgcagc 


aggtgctgcc 


gagcgcggcg 


ccgcgccggg 


16579 


gcttcaagcg 


cgagggcggc 


gaggatctgt 


acccgaccat 


gcagctgatg 


gtgcccaagc 


16639 


gccagaagct 


ggaggacgtg 


ctggagcaca 


tgaaggtgga 


ccccgaggtg 


cagcccgagg 


16699 


tcaaggtgcg 


gcccatcaag 


caggtggccc 


cgggcctggg 


cgtgcagacc 


gtggacatca 


16759 


agatccccac 


ggagcccatg 


gaaacgcaga 


ccgagcccgt 


gaagcccagc 


accagcacca 


16819 


tggaggtgca 


gacggatccc 


tggatgccgg 


cgccggcttc 


caccactcgc 


cgaagacgca 


16879 


agtacggcgc 


ggccagcctg 


ctgatgccca 


actacgcgct 


gcatccttcc 


atcatcccca 


16939 


cgccgggcta 


ccgcggcacg 


cgcttctacc 


gcggctacac 


cagcagccgc 


cgcaagacca 


16999 


ccacccgccg 


ccgccgtcgt 


cgcacccgcc 


gcagcagcac 


cgcgacttcc 


gccgccgccc 


17059 


tggtgcggag 


agtgtaccgc 


agcgggcgcg 


agcctctgac 


cctgccgcgc 


gcgcgctacc 


17119 


acccgagcat 


cgccatttaa 


ctctgccgtc 


gcctcctact 


tgcagatatg 


gccctcacat 


17179 


gccgcctccg 


cgtccccatt 


acgggctacc 


gaggaagaaa 


gccgcgccgt 


agaaggctga 


17239 


cggggaacgg 


gctgcgtcgc 


catcaccacc 


ggcggcggcg 


cgccatcagc 


aagcggttgg 


17299 


ggggaggctt 


cctgcccgcg 


ctgatcccca 


tcatcgccgc 


ggcgatcggg 


gcgatccccg 


17359 


gcatagcttc 


cgtggcggtg 


caggcctctc 


agcgccactg 


agacacagct 


tggaaaattt 


17419 


gtaataaaaa 


aatggactga 


cgctcctggt 


cctgtgatgt 


gtgtttttag 


atggaagaca 


17479 


tcaatttttc 


gtccctggca 


ccgcgacacg 


gcacgcggcc 


gtttatgggc 


acctggagcg 


17539 


acatcggcaa 


cagccaactg 


aacgggggcg 


ccttcaattg 


gagcagtctc 


tggagcgggc 


17599 


ttaagaattt 


cgggtccacg 


ctcaaaacct 


atggcaacaa 


ggcgtggaac 


agcagcacag 


17659 


ggcaggcgct 


gagggaaaag 


ctgaaagagc 


agaacttcca 


gcagaaggtg 


gtcgatggcc 


17719 


tggcctcggg 


catcaacggg 


gtggtggacc 


tggccaacca 


ggccgtgcag 


aaacagatca 


17779 


acagccgcct 


ggacgcggtc 


ccgcccgcgg 


ggtccgtgga 


gatgccccag 


gtggaggagg 


17839 
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cgccggtggc 


cgtcgcgtcg 


cgcgcccccc 


gaggccgccc 


ccaggcgaac 


tggcagagca 


18139 


ctctgaacag 


catcgtgggt 


ctgggagtgc 


agagtgtgaa 


gcgccgccgc 


tgctattaaa 


18199 


agacactgta 


gcgcttaact 


tgcttgtctg 


tgtgtatatg 


tatgtccgcc 


gaccagaagg 


18259 


aggaagaggc 


gcgtcgccga 


gttgcaag atg gcc acc cca teg atg 


ctg ccc 


18311 



Met Ala Thr Pro Ser Met Leu Pro 
535 

cag tgg gcg tac atg cac ate gcc gga cag gac get teg gag tac ctg 18359 

Gin Trp Ala Tyr Met His lie Ala Gly Gin Asp Ala Ser Glu Tyr Leu 

540 ~ " 545 550 ~ 555 

agt ccg ggt ctg gtg cag ttc gcc cgc gcc aca gac acc tac ttc agt 18407 

Ser Pro Gly Leu Val Gin Phe Ala Arg Ala Thr Asp Thr Tyr Phe Ser 

560 565 " 570 

ctg ggg aac aag ttt agg aac ccc acg gtg gcg ccc acg cac gat gtg 18455 

Leu Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro Thr His Asp Val 

575 580 585 

acc acc gac cgc age cag egg ctg acg ctg cgc ttc gtg ccc gtg gac 18503 

Thr Thr Asp Arg Ser Gin Arg Leu Thr Leu Arg Phe Val Pro Val Asp 
590 595 600 

cgc gag gac aac acc tac teg tac aaa gtg cgc tac acg ctg gcc gtg 18551 

Arg Glu Asp Asn Thr Tyr Ser Tyr Lys Val Arg Tyr Thr Leu Ala Val 
605 610 615 

ggc gac aac cgc gtg ctg gac atg gcc age acc tac ttt gac ate cgc 18599 

Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr Phe Asp lie Arg 

620 625 630 635 

ggc gtg ctg gat egg ggg ccc age ttc aaa ccc tac tec ggc acc gcc 18647 

Gly Val Leu Asp Arg Gly Pro Ser Phe Lys Pro Tyr Ser Gly Thr Ala 

640 645 650 

tac aac age ctg get ccc aag gga gcg ccc aac act tgc cag tgg aca 18695 

Tyr Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Thr Cys Gin Trp Thr 

655 660 665 

tat aaa get ggt gat act gat aca gaa aaa acc tat aca tat gga aat 18743 

Tyr Lys Ala Gly Asp Thr Asp Thr Glu Lys Thr Tyr Thr Tyr Gly Asn 
670 675 680 

gca cct gtg caa ggc att age att aca aag gat ggt att caa ctt gga 18791 

Ala Pro Val Gin Gly lie Ser lie Thr Lys Asp Gly lie Gin Leu Gly 
685 690 695 
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act gac age gat ggt cag gca ate tat gca gac gaa act tat caa cca 18839 

Thr Asp Ser Asp Gly Gin Ala lie Tyr Ala Asp Glu Thr Tyr Gin Pro 

700 ~ ~ 705 710 715 

gag cct caa gtg ggt gat get gaa tgg cat gac ate act ggt act gat 18887 
Glu Pro Gin Val Gly Asp Ala Glu Trp His Asp lie Thr Gly Thr Asp 

720 725 730 

gaa aaa tat gga ggc aga get ctt aag cct gac acc aaa atg aag cct 18 935 

Glu Lys Tyr Gly Gly Arg Ala Leu Lys Pro Asp Thr Lys Met Lys Pro 
7 35 ~ 740 745 

tgc tat ggt tct ttt gee aag cct acc aat aaa gaa gga ggc cag gca 18983 

Cys Tyr Gly Ser Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gin Ala 
750 " 755 760 

aat gtg aaa acc gaa aca ggc ggt acc aaa gaa tat gac att gac atg 19031 

Asn Val Lys Thr Glu Thr Gly Gly Thr Lys Glu Tyr Asp lie Asp Met 
765 770 775 

gca ttc ttc gat aat cga agt gca get gee gee ggc eta gee cca gaa 19079 

Ala Phe Phe Asp Asn Arg Ser Ala Ala Ala Ala Gly Leu Ala Pro Glu 

780 785 790 795 

att gtt ttg tat act gag aat gtg gat ctg gaa act cca gat acc cat 19127 

lie Val Leu Tyr Thr Glu Asn Val Asp Leu Glu Thr Pro Asp Thr His 

800 805 810 

att gta tac aag gca ggt aca gat gac agt age tct tct ate aat ttg 19175 

lie Val Tyr Lys Ala Gly Thr Asp Asp Ser Ser Ser Ser lie Asn Leu 
815 820 825 

ggt cag cag tec atg ccc aac aga ccc aac tac att ggc ttc aga gac 19223 

Gly Gin Gin Ser Met Pro Asn Arg Pro Asn Tyr He Gly Phe Arg Asp 
830 835 840 

aac ttt ate ggt ctg atg tac tac aac age act ggc aat atg ggt gta 19271 

Asn Phe He Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val 
845 850 855 

ctg get gga cag gee tec cag ctg aat get gtg gtg gac ttg cag gac 19319 

Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu Gin Asp 

860 865 870 875 

aga aac acc gaa ctg tec tac cag etc ttg ctt gac tct ctg ggt gac 19367 

Arg Asn Thr Glu Leu Ser Tyr Gin Leu Leu Leu Asp Ser Leu Gly Asp 

880 885 890 

aga acc agg tat ttc agt atg tgg aat cag gcg gtg gac agt tat gac 19415 

Arg Thr Arg Tyr Phe Ser Met Trp Asn Gin Ala Val Asp Ser Tyr Asp 
895 900 905 

ccc gat gtg cgc att att gaa aat cac ggt gtg gag gat gaa ctt cct 19463 

Pro Asp Val Arg He lie Glu Asn His Gly Val Glu Asp Glu Leu Pro 
910 915 920 
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aac tat tgc ttc ccc ctg gat get gtg ggt aga act gat act tac cag 19511 
Asn Tyr Cys Phe Pro Leu Asp Ala Val Gly Arg Thr Asp Thr Tyr Gin 
925 930 935 

gga att aag gec aat ggt gat aat caa acc acc tgg acc aaa gat gat 19559 
Gly lie Lys Ala Asn Gly Asp Asn Gin Thr Thr Trp Thr Lys Asp Asp 
940 945 950 955 

act gtt aat gat get aat gaa ttg ggc aag ggc aat cct ttc gee atg 19607 
Thr Val Asn Asp Ala Asn Glu Leu Gly Lys Gly Asn Pro Phe Ala Met 
960 965 970 

gag ate aac ate cag gee aac ctg tgg egg aac ttc etc tac gcg aac 19655 
Glu lie Asn lie Gin Ala Asn Leu Trp Arg Asn Phe Leu Tyr Ala Asn 
975 980 985 

gtg gcg ctg tac ctg ccc gac tec tac aag tac acg ccg gee aac ate 19703 
Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys Tyr Thr Pro Ala Asn lie 
990 995 1000 

acg ctg ccc acc aac acc aac acc tac gat tac atg aac ggc cgc 19748 
Thr Leu Pro Thr Asn Thr Asn Thr Tyr Asp Tyr Met Asn Gly Arg 
1005 1010 1015 

gtg gtg gcg ccc teg ctg gtg gac gee tac ate aac ate ggg gcg 19793 
Val Val Ala Pro Ser Leu Val Asp Ala Tyr lie Asn lie Gly Ala 
1020 1025 1030 

cgc tgg teg ctg gac ccc atg gac aac gtc aac ccc ttc aac cac 19838 
Arg Trp Ser Leu Asp Pro Met Asp Asn Val Asn Pro Phe Asn His 
1035 1040 ^ 1045 

cac cgc aac gcg ggc ctg cga tac cgc tec atg etc ctg ggc aac 19883 
His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu Gly Asn 
1050 1055 1060 

ggg cgc tac gtg ccc ttc cac ate cag gtg ccc caa aag ttt ttc 19928 
Gly Arg Tyr Val Pro Phe His lie Gin Val Pro Gin Lys Phe Phe 
1065 1070 1075 

gee ate aag age etc ctg etc ctg ccc ggg tec tac acc tac gag 19973 
Ala lie Lys Ser Leu Leu Leu Leu Pro Gly Ser Tyr Thr Tyr Glu 
1080 1085 1090 

tgg aac ttc cgc aag gac gtc aac atg ate ctg cag age tec etc 20018 
Trp Asn Phe Arg Lys Asp Val Asn Met lie Leu Gin Ser Ser Leu 
1095 1100 1105 

ggc aac gac ctg cgc acg gac ggg gee tec ate gee ttc acc age 20063 
Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser lie Ala Phe Thr Ser 
1110 1115 " 1120 

ate aac etc tac gee acc ttc ttc ccc atg gcg cac aac acc gee 20108 
lie Asn Leu Tyr Ala Thr Phe Phe Pro Met Ala His Asn Thr Ala 
1125 1130 1135 
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tec acg etc gag gec atg 

Ser Thr Leu Glu Ala Met 
1140 

ttc aac gac tac etc teg 

Phe Asn Asp Tyr Leu Ser 
1155 

gee aac gee ace aac gtg 

Ala Asn Ala Thr Asn Val 
1170 

gec gee ttc cgc ggc tgg 

Ala Ala Phe Arg Gly Trp 
1185 

acg ccc teg etc ggc tec 

Thr Pro Ser Leu Gly Ser 
1200 

ggc tec ate ccc tac etc 

Gly Ser lie Pro Tyr Leu 
1215 

ttc aag aag gtc tec ate 

Phe Lys Lys Val Ser lie 
1230 

ggc aac gac cgc etc ctg 

Gly Asn Asp Arg Leu Leu 
1245 

acc gtc gac gga gag ggg 

Thr Val Asp Gly Glu Gly 
1260 

aag gac tgg ttc ctg gtc 

Lys Asp Trp Phe Leu Val 
1275 

tac cag ggc ttc tac gtg 

Tyr Gin Gly Phe Tyr Val 
1290 

tec ttc ttc cgc aac ttc 

Ser Phe Phe Arg Asn Phe 
1305 

gag gtc aac tac aag gac 

Glu Val Asn Tyr Lys Asp 
1320 

cac aac aac teg ggc ttc 

His Asn Asn Ser Gly Phe 
1335 



ctg cgc aac gac acc 
Leu Arg Asn Asp Thr 
1145 

gcg gee aac atg etc 
Ala Ala Asn Met Leu 
1160 

ccc ate tec ate ccc 
Pro lie Ser lie Pro 
1175 

tec ttc acg cgc etc 
Ser Phe Thr Arg Leu 
1190 

ggg ttc gac ccc tac 
Gly Phe Asp Pro Tyr 
1205 

gac ggc acc ttc tac 
Asp Gly Thr Phe Tyr 
1220 

acc ttc gac tec tec 
Thr Phe Asp Ser Ser 
1235 

acg ccc aac gag ttc 
Thr Pro Asn Glu Phe 
1250 

tac aac gtg gee cag 
Tyr Asn Val Ala Gin 
1265 

cag atg ctg gee cac 
Gin Met Leu Ala His 
1280 

ccc gag ggc tac aag 
Pro Glu Gly Tyr Lys 
1295 

cag ccc atg age cgc 
Gin Pro Met Ser Arg 
1310 

tac cag gec gtc acc 
Tyr Gin Ala Val Thr 
1325 

gtc ggc tac etc gcg 
Val Gly Tyr Leu Ala 
1340 



aac gac cag tec 20153 

Asn Asp Gin Ser 

1150 

tac ccc ate ccg 20198 

Tyr Pro lie Pro 

1165 

teg cgc aac tgg 20243 

Ser Arg Asn Trp 

1180 

aag acc cgc gag 20288 

Lys Thr Arg Glu 

1195 

ttc gtc tac teg 20333 

Phe Val Tyr Ser 

1210 

etc aac cac acc 20378 

Leu Asn His Thr 

1225 

gtc age tgg ccc 20423 

Val Ser Trp Pro 

1240 

gaa ate aag cgc 20468 

Glu lie Lys Arg 

1255 

tgc aac atg acc 20513 

Cys Asn Met Thr 

1270 

tac aac ate ggc 20558 

Tyr Asn lie Gly 

1285 

gac cgc atg tac 20603 

Asp Arg Met Tyr 

1300 

cag gtc gtg gac 20648 

Gin Val Val Asp 

1315 

ctg gee tac cag 20693 

Leu Ala Tyr Gin 

1330 

ccc acc atg cgc 20738 

Pro Thr Met Arg 

1345 
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cag ggc cag ccc tac ccc gcc aac tac ccc tac ccg etc ate ggc 20783 

Gin Gly Gin Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu He Gly 

1350 1355 1360 

aag age gcc gtc gcc age gtc acc cag aaa aag ttc etc tgc gac 20828 

Lys Ser Ala Val Ala Ser Val Thr Gin Lys Lys Phe Leu Cys Asp 

1365 1370 1375 



egg gtc atg tgg cgc ate ccc ttc tec age aac ttc atg tec atg 

Arg Val Met Trp Arg He Pro Phe Ser Ser Asn Phe Met Ser Met 

1380 1385 1390 

ggc gcg etc acc gac etc ggc cag aac atg etc tac gcc aac tec 

Gly Ala Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala Asn Ser 

1395 1400 1405 

gcc cac gcg eta gac atg aat ttc gaa gtc gac ccc atg gat gag 

Ala His Ala Leu Asp Met Asn Phe Glu Val Asp Pro Met Asp Glu 

1410 1415 1420 

tec acc ctt etc tat gtt gtc ttc gaa gtc ttc gac gtc gtc cga 

Ser Thr Leu Leu Tyr Val Val Phe Glu Val Phe Asp Val Val Arg 

1425 1430 1435 

gtg cac cag ccc cac cgc ggc gtc ate gag gcc gtc tac ctg cgc 

Val His Gin Pro His Arg Gly Val He Glu Ala Val Tyr Leu Arg 

1440 1445 1450 

acg ccc ttc teg gcc ggc aac gcc acc acc taa gcctcttgct 

Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr 
1455 1460 



20873 



20918 



20963 



21008 



21053 



21096 



tettgeaaga 


tgacggcctg 


egcgggctcc 


ggegagcagg 


agctcagggc 


catcctccgc 


21156 


gacctgggct 


gcgggccctg 


cttcctgggc 


accttcgaca 


agcgcttccc 


gggattcatg 


21216 


gccccgcaca 


agctggcctg 


cgccatcgtc 


aacacggccg 


gccgcgagac 


egggggegag 


21276 


cactggctgg 


ccttcgcctg 


gaacccgcgc 


tcccacacct 


gctacctctt 


cgaccccttc 


21336 


gggttctcgg 


acgagcgcct 


caagcagatc 


taccagttcg 


agtacgaggg 


cctgctgcgt 


21396 


cgcagcgccc 


tggccaccga 


ggaccgctgc 


gtcaccctgg 


aaaagtccac 


ccagaccgtg 


21456 


cagggtccgc 


gctcggccgc 


ctgcgggctc 


ttctgetgea 


tgttcctgca 


cgccttcgtg 


21516 


cactggcccg 


accgccccat 


ggacaagaac 


cccaccatga 


acttgetgae 


gggggtgccc 


21576 


aacggcatgc 


tccagtcgcc 


ccaggtggaa 


cccaccctgc 


gccgcaacca 


ggaggegetc 


21636 


taccgcttcc 


tcaacgccca 


ctccgcctac 


tttcgctccc 


accgcgcgcg 


catcgagaag 


21696 


gccaccgcct 


tcgaccgcat 


gaatcaagac 


atgtaatccg 


gtgtgtgtat 


gtgaatgett 


21756 


tattcatcat 


aataaacagc 


acatgtttat 


gccaccttct 


ctgaggctct 


gactttattt 


21816 


agaaatcgaa 


ggggttctgc 


cggctctcgg 


catggcccgc 


gggcagggat 


acgttgcgga 


21876 
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actggtactt gggcagccac ttgaactcgg ggatcagcag cttcggcacg gggaggtcgg 21936 

ggaacgagtc gctccacagc ttgcgcgtga gttgcagggc gcccagcagg tcgggcgcgg 21996 

agatcttgaa atcgcagttg ggacccgcgt tctgcgcgcg agagttacgg tacacggggt 22056 

tgcagcactg gaacaccatc agggccgggt gcttcacgct cgccagcacc gtcgcgtcgg 22116 

tgatgccctc cacgtccaga tcctcggcgt tggccatccc gaagggggtc atcttgcagg 22176 

tctgccgccc catgctgggc acgcagccgg gcttgtggtt gcaatcgcag tgcaggggga 22236 

tcagcatcat ctgggcctgc tcggagctca tgcccgggta catggccttc atgaaagcct 22296 

ccagctggcg gaaggcctgc tgcgccttgc cgccctcggt gaagaagacc ccgcaggact 22356 

tgctagagaa ctggttggtg gcgcagccag cgtcgtgcac gcagcagcgc gcgtcgttgt 22416 

tggccagctg caccacgctg cgcccccagc ggttctgggt gatcttggcc cggtcggggt 22476 

tctccttcag cgcgcgctgc ccgttctcgc tcgccacatc catctcgatc gtgtgctcct 22536 

tctggatcat cacggtcccg tgcaggcacc gcagcttgcc ctcggcctcg gtgcacccgt 22596 

gcagccacag cgcgcagccg gtgctctccc agttcttgtg ggcgatctgg gagtgcgagt 22656 

gcacgaagcc ctgcaggaag cggcccatca tcgtggtcag ggtcttgttg ctggtgaagg 22716 

tcagcggaat gccgcggtgc tcctcgttca catacaggtg gcagatacgg cggtacacct 22776 

cgccctgctc gggcatcagc tggaaggcgg acttcaggtc gctctccacg cggtaccggt 22836 

ccatcagcag cgtcatcact tccatgccct tctcccaggc cgaaacgatc ggcaggctca 22896 

gggggttctt caccgttgtc atcttagtcg ccgccgccga agtcaggggg tcgttctcgt 22956 

ccagggtctc aaacactcgc ttgccgtcct tctcggtgat gcgcacgggg ggaaagctga 23016 

agcccacggc cgccagctcc tcctcggcct gcctttcgtc ctcgctgtcc tggctgatgt 23076 

cttgcaaagg cacatgcttg gtcttgcggg gtttcttttt gggcggcaga ggcggcggcg 23136 

gagacgtgct gggcgagcgc gagttctcgc tcaccacgac tatttcttct ccttggccgt 23196 

cgtccgagac cacgcggcgg taggcatgcc tcttctgggg cagaggcgga ggcgacgggc 23256 

tctcgcggtt cggcgggcgg ctggcagagc cccttccgcg ttcgggggtg cgctcctggc 23316 

ggcgctgctc tgactgactt cctccgcggc cggccattgt gttctcctag ggagcaagca 23376 

tggagactca gccatcgtcg ccaacatcgc catctgcccc cgccgccgcc gacgagaacc 23436 

agcagcagca gaatgaaagc ttaaccgccc cgccgcccag ccccacctcc gacgccgcag 23496 

ccccagacat gcaagagatg gaggaatcca tcgagattga cctgggctac gtgacgcccg 23556 
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agatcaggtt 
tcacccaggg 
tgctgaaaaa 
gcttccccca 
ccggaggatt 
tgggacagca 
gtggaggagg 
aaagcaagca 
agtagatggg 
cggcagggat 
tgcgggggca 
ccccgcaaca 
gaggcagaaa 
cacagcggcg 
gaggaaccgg 
ggaactgaaa 
gagcgaagac 
cgcgctcact 
gtcaccacct 
atgtggagct 
cgcatgaact 
caccgaaacc 
aatccgcgta 
ctacttccgc 
ggcggcgccg 
ggcagaggca 
ggagtcttcc 
actttggaga 



ctacgaggac 
ggccatcctg 
gggccacggg 
ggatgccccg 
tggaggaaga 
ctcaggcaga 
cagaggaaga 
gcacggatac 
acgagaccgg 
acaagtcctg 
acatctcctt 
tcttgcatta 
cccagcagca 
gcaggtggac 
atctttccca 
gtcaagaacc 
caacttcagc 
cttaaagagt 
gcgcccttcg 
accagcccca 
ggctcagtgc 
agatactcct 
attggcccgc 
gagacgccca 
ccctgtgtcg 
cacagctcaa 
aactcgccgg 
gttcgtcctc 



caatcccagc 
gcccaattgc 
gtctacttgg 
aggaagcagc 
ctgggagagc 
ggaggacagc 
agcagccgcc 
catctccgct 
gcgcttcccg 
gcgggggcac 
cacccggcgc 
ctaccgtcac 
gcagcagcag 
tgaggatcgc 
ccctctatgc 
gttctctgcg 
gcactctcga 
agcccgcgcc 
cccgaccatc 
gatgggcctg 
cgggcccgcg 
agaacagtca 
cgccctggtg 
ggccgaagtc 
tcaccgcccc 
cgacgaggtg 
atcggggaga 
gcagccccgc 



cgcccaaggc 
aagccatcca 
acccccagac 
aagaagctga 
agtcaggcag 
ctgcaagaca 
gccagaccgt 
ccgggtcggg 
aaccccacca 
aaaaacgcca 
tacctgctct 
ctccacagcc 
cagaaaacca 
ggcgaacgag 
catcttccag 
ctcgctcacc 
ggacgccgag 
cgcccacaca 
atcatgagca 
gccgccggcg 
atgatctcac 
gcgatcaccg 
taccaggaaa 
cagctgacta 
gctcagggta 
gtgagctctt 
tcttccttca 
tcgggtggca 



cgagctgtcg 
gaaatcccgc 
cggagaggag 
aagtggagct 
aggaggagga 
gtctggagga 
cgtcctcggc 
gtcgcggcgg 
cccagaccgg 
tcgtctcctg 
tccaccgcgg 
cctactactg 
gcggcagcag 
ccggcgcaga 
cagagtcggg 
cgcagttgtc 
gctctcttca 
cggaaaaagg 
aagagattcc 
ccgcccagga 
gggtgaatga 
ccacgccccg 
ttccccagcc 
actcaggtgt 
taaagcggct 
cgctgggtct 
cgcctcgtca 
tcggcactct 



gcctgcgtca 
caagaatttc 
ctcaacccca 
gccgccgccg 
gatggaagac 
ggaagacgag 
ggaggaggag 
ccgggcccac 
taagaaggag 
cttgcaagcc 
ggtgaacttc 
tttccaagaa 
ctagaaaatc 
cccgggagct 
ggcaagagca 
tgtatcacaa 
acaagtactg 
cgggaattac 
cacgccttac 
ctactccacc 
catccgcgcc 
ccatcacctt 
cacgaccgta 
ccagctggcc 
ggtgatccga 
gcgacctgac 
ggccgtcctg 
ccagttcgtg 



25356 
25416 
25476 
25536 
25596 
25656 
25716 
25776 
25836 
25896 
25956 
26016 
26076 
26136 
26196 
26256 
26316 
26376 
26436 
26496 
26556 
26616 
26676 
26736 
26796 
26856 
26916 
26976 
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gaggagttca 


ctccctcggt 


ctacttcaac 


cccttctccg 


gctcccccgg 


ccactacccg 


Z / U o b 


gacgagttca 


tcccgaactt 


cgacgccatc 


agegagt egg 


t ggaeggcta 


cgaT-ngaai-.g 


Z / U z> b 


tcccatggtg 


gcgcggctga 


cctagctcgg 


cttcgacacc 


tggaccactg 


ccgccgcttc 


Z / lOD 


cgctgcttcg 


ctcgggatct 


cgccgagttt 


gectactttg 


agctgcccga 


ggagcacccu 


Z / Z _L O 


cagggcccgg 


cccacggagt 


gcg gat cgt c 


gtcgaagggg 


r*t 4— 4— /~1 /T si /—i 4~~ /—i 

g TZC l. CyaCtC 


« i— « 4-" /"*r ^ T" 4-~ 

CCdCCLCjCLL 


on on a 

^ I Z / D 


cggatcttca 


gccagcgtcc 


gatcctggcc 


gagegegage 


aaggacagac 


ccttctgacc 


z / o o b 


ctgtactgca 


tctgcaacca 


ccccggcctg 


catgaaagt c 


4-4- 4_„4_4_„4_ _ 4- 

tttgttgtct 


gctgtgtact 


z / o y b 


gagtataata 


aaagctgaga 


teagegacta 


ctccggactt 


ccgtgtgttc 


ctgctatcaa 


z / 4 o b 


ccagtccctg 


ttcttcaccg 


ggaacgagac 


cgagctccag 


ctccagtgta 


agccccacaa 


Z / o JL b 


gaagtacctc 


acctggctgt 


tccagggctc 


tccgat cgee 


gttgtcaacc 


a ctgcgacaa 


z / o / b 


cgacggagtc 


ctgctgagcg 


gccctgccaa 


_ 4_ 4 , _4_ 4- 4- 4- 

ccttactttt 


tccacccgca 


gaagcaaycL 


9 7 ^ ^ f 


ccagctcttc 


caacccttcc 


t ccccgggac 


ctatcagtgc 


gt ct egggae 


cctgccatca 


Z / DrD 


caccttccac 


ctgatcccga 


ataccacagc 


gtcg ct cccc 


get act aaca 


accaaactac 


Z / / jo 


ccaccaacgc 


caccgtcgcg 


acctttcctc 


tgggtctaat 


accactaccg 


gaggtgagct 


Z / o _L b 


ccgaggtcga 


ccaacctctg 


ggatttacta 


cggcccctgg 


gaggtggtag 


ggttaatagc 


o ""7 o n a 
z / o / b 


gctaggccta 


gttgcgggtg 


ggcttttggc 


tctctgctac 


ctatacct cc 


cttgctgtt c 


z / y o b 


gtacttagtg 


gtgctgtgtt 


gctggtttaa 


gaaatgggga 


agatcaccct 


agLgagcugc 


Z / _? D 


ggtgtgctgg 


tggcggtggt 


gctttcgatt 


gtgggactgg 


gcggcgcggc 


tigt-agt-gaag 


ZOUJO 


gagaaggccg 


at ccctgctt 


gcatttcaat 


cccgacaaat 


gecagcuyay 


4- 4- 4- 4- r 1 a i—* t~* f* 
LULL. L-ay UtU 


9fi1 1 6 

/-OilO 


gatggcaat c 


ggtgcgcggt 


get gat caag 


tgcggatggg 


aatgcgagaa 


cguyay ad l. c 


z o x / o 


gagtacaata 


acaagact eg 


gaacaa t act 


ct eg cgt ccg 


tgtggcagcc 


cggggacccc 


z o z o o 


gagtggtaca 


ccgtctctgt 


ccccggt get 


gacggctccc 


cgcgcaccgt 


gaauaatact 


ZOZi'D 


ttcatttttg 


cgcacatgtg 


cgacacggtc 


atgtggatga 


gcaagcagta 


cgatat gtgg 


Z o o 3 b 


ccccccacga 


aggagaacat 


cgtggt ctt c 


tec a teg ctt 


acagcgTi.gTi.g 


cacggcgcua 


9P41 f; 
Z O ft J_ o 


atcaccgcta 


tcgtgtg cct 


gagcattcac 


atgetcateg 


ctatt cgccc 


cagaaataat 


9 p zi n & 
z o *± / o 


gccgaaaaag 


aaaaacagee 


ataacaegtt 


ttttcacaca 


cctttttcag 


accatggcct 


28536 


ctgttaaatt 


tttgetttta 


tttgccagtc 


teattgeegt 


cattcatgga 


atgagtaatg 


28596 


agaaaattac 


tatttacact 


ggcactaatc 


acacattgaa 


aggtccagaa 


aaagccacag 


28656 


aagtttcatg 


gtattgttat 


tttaatgaat 


cagatgtatc 


tactgaactc 


tgtggaaaca 


28716 
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ataacaaaaa 


aaatgagagc 


attactctca 


tcaagtttca 


atgtggatct 


gacttaaccc 


28776 


taattaacat 


cactagagac 


tatgtaggta 


tgtattatgg 


aactacagca 


ggcatttcgg 


28836 


acatggaatt 


ttatcaagtt 


tctgtgtctg 


aacccaccac 


gcctagaatg 


accacaacca 


28896 


caaaaactac 


acctgttacc 


actatacagc 


tcactaccaa 


tggctttctt 


gccatgcttc 


28956 


aagtggctga 


aaatagcacc 


agcattcaac 


ccaccccacc 


cagtgaggaa 


attcccagat 


29016 


ccatgattgg 


cattattgtt 


gctgtagtgg 


tgtgcatgtt 


gatcatcgcc 


ttgtgcatgg 


29076 


tgtactatgc 


cttctgctac 


agaaagcaca 


gactgaacga 


caagctggaa 


cacttactaa 


29136 


gtgttgaatt 


ttaatttttt 


agaaccatga 


agatcctagg 


ccttttagtt 


ttttctatca 


29196 


ttacctctgc 


tctatgcaat 


tctgacaatg 


aggacgttac 


tgtcgttgtc 


ggatcaaatt 


29256 


atacactaaa 


aggtccagca 


aaaggtatgc 


tttcgtggta 


ttgttggttc 


ggaactgacg 


29316 


agcaacagac 


agaactttgc 


aatgctcaaa 


aaggcaaaac 


ctcaaattct 


aaaatctcta 


29376 


attatcaatg 


caatggcact 


gacttagtat 


tgctcaatgt 


cacgaaagca 


tatgctggca 


29436 


gttacacctg 


ccctggagat 


gatgccgaca 


atatgatttt 


ttacaaagtg 


gaagtggttg 


29496 


atcccactac 


tccaccgccc 


accaccacaa 


ctactcatac 


cacacacaca 


gaacaaacac 


29556 


cagaggcagc 


agaagcagag 


ttggccttcc 


aggttcacgg 


agattccttt 


gctgtcaata 


29616 


cccctacacc 


cgatcagcgg 


tgtccggggc 


tgctcgtcag 


cggcattgtc 


ggtgtgcttt 


29676 


cgggattagc 


agtcataatc 


atctgcatgt 


tcatttttgc 


ttgctgctat 


agaaggcttt 


29736 


accgacaaaa 


atcagaccca 


ctgctgaacc 


tctatgttta 


attttttcca 


gagccatgaa 


29796 


ggcagttagc 


gctctagttt 


tttgttcttt 


gattggcatt 


gtttttagtg 


ctgggttttt 


29856 


gaaaaatctt 


accatttatg 


aaggtgagaa 


tgccactcta 


gtgggcatca 


gtggtcaaaa 


29916 


tgtcagctgg 


ctaaaatacc 


atctagatgg 


gtggaaagac 


atttgcgatt 


ggaatgtcac 


29976 


tgtgtataca 


tgtaatggag 


ttaacctcac 


cattactaat 


gccacccaag 


atcagaatgg 


30036 


taggtttaag 


ggccagagtt 


tcactagaaa 


taatgggtat 


gaatcccata 


acatgtttat 


30096 


ctatgacgtc 


actgtcatca 


gaaatgagac 


tgccaccacc 


acacagatgc 


ccactacaca 


30156 


cagttctacc 


actactacca 


tgcaaaccac 


acagacaacc 


actacatcaa 


ctcagcatat 


30216 


gaccaccact 


acagcagcaa 


agccaagtag 


tgcagcgcct 


cagccccagg 


ctttggcttt 


30276 


gaaagctgca 


caacctagta 


caactactag 


gaccaatgag 


cagactactg 


aatttttgtc 


30336 


cactgtcgag 


agccacacca 


cagctacctc 


cagtgccttc 


tctagcaccg 


ccaatctctc 


30396 
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ctcgctttcc 


tctacaccaa 


4/" y*^ 1 "\ — f + (~~* /T /^t 

Lcay tcccyc 


LdLLdLLLLL 


dLLLLdy LLL 


1 - i _ p""l — p*p*p , p , a p* 
L LL LLLLLetL 


JU1JU 


tcccctgaag 


ca a a c L.g agg 


a p< o pt i~< /~r a "H* 
dLdy Lgy La L 


yLcict Ly y Lay 


CtLLctLLLLyL 


4- r~> 4- 4- "t - Pf 3 4~ 
l Let l l y l y ct l 


30516 


cggg ttgg t c 


atcctggccg 


tgttgctcta 


^ 4~~ ^ -f* 4** 4** / — i 
LLaCaLLLLL 


Lgccgccgca 


4- 4— <r> ^ r** 

LLCCCddCgC 


0 fj rr 7 p; 


gcaccgcaaa 


ccggcctaca 


agcccatcgt 


Latcgggcag 


1—* f< ^ P* Pf P* 

ccggdgcL-gL 


~\ — 4~ t—* o y^f /~f 4~" /^f j^f "3 

l l Let y y l y y ct 


*3 n fi ^ 

JUG JO 


agggggLCLa 


aggaatCLLc 


LlLLCLClLL 


LdCagLa tgg 


4~ /^f -a 4— 4— a ^» 4— 
LydLLyddLL 


n 4- pf a 4* 4- pni* a 
ct Lycl L L LL Let 


O L L D 


«— * ^ 4-» 4-» y— * 4— 4~~ 
gd CaauLCtL 


gatCaCuatU 


_ 4— 4— = 4— f~i 4— /—i /~t 

LLLdLLLyLL 


LLLLLLctcty L 


pi" pf - PT P* P* 3 P , P* 
LLy Ly LLctLL 


<— « 4— p<pfp»4— p4- prpf 
LLLyLLLLyy 


3075 6 


tggccaacgc 


caguccagac 


■4— 4~ 4~" "i — /->r /^r 

tytatLgggc 


LL L LCgCL L L 


LLdLLj L y L L L 


4~ 4~ 4~ pf p» p* 4~ 4~ p« 
LLLyLLLL Let 


JUOIU 


fccacctgcat 


***t 4™ Hr™ rf^i 4— j-*t 4~* 

CL.gCt.gcUy I. 


agcatagLCL 


4— t*-i 4— 4— ^ 4— 

gCCtyCLtaL 


La LLL LLLLL 


p«a pt4— 4— r" 'a 4~ 4p pf 
Lcty L L Let L L y 


0 u 0 / 0 


actggatcLL 


t g tgcgca t c 


gCCLaCCLyC 


f~r t—* *3 1 « < 

yCCdCCdCCC 


LLdy Let LLy L 


yctLLcty Lycty 


O W _7 O U 


fcggcgcggcfc 


gctcaggctc 


/—i 4— /— < 4^* ^nt 4-» — v ^ f** 

CLCLgaLday 


CaLycgggct 


« 4— . 4- -a 4- 4- /-< 
LLgCLctLLLL 


4— c~* pf p 1 pf p* +* 4~ p< 4~ 

L Ly Ly L L LLL 


O L> O 




/— ■* 4— t—t r~> r~* t~* rf /—» /""• 
LLLLLLLyLL 


/—i < /-f 4- p»p*p< 
LLy LLyaLLL 


LLyy LLLLLL 


a P 1 +" P 1 ^ Pf 4~ P< P< P* 
CtL L Let y LLLL 


p*p , pf^ 3 pf ;a pfpf 4* 
LLyctetyctyy l 




ccgcaaatgc 


aaatt ccaag 


aaccc tggaa 


attcctcaaa 


tgctaccgcc 


aaaaatcaga 


01 1 1 p 
0 ± ± JL O 


catgcttccc 


agctggatca 


tgatcatLgg 


gatcg tgaac 


4— 4— « 4— s-~r t~~^r r~i 4— 

dLLCuggcCL 


yCdCCCL Cat 


0 ± ± / D 


ct cctttgtg" 


— * 4— 4— 4-" — \ v-h /—i /-i 

aLLLdCCCCL 


^ 4— 4— 4— y^r « 4— 4— 

gc l l Lgac L L 


LggLLggaac 


LLyLLciyctyy 


p , pfp*4~p , 4 - a4~/-<4~ 
Ly LLL Let LLL 


O J_ <£. O \J 


cccgccLgaa 


CCLgaCaCaC 


LcALLciLJciy Lei 


clLLLLctyy L ct 


i — i prfpapi"a p* 
LctLyLctLLctL 


pa p p a p a 
LctLLctLLctLa 


O JL ^i. -? L> 


pt p« p«4 - :a rr p» f" 1 a 


Udct LaLa LyL 


L L Ct Let L Lctyct 


LLdLyduyLL 


Pf P" P* ^1 p< f-r <— • 
yctyLLctLcty l 


rr^ r , p i r^3+"pfp*'t~ 
y a 0 0 a lu^l 


3135 6 


ppppfTP'ha'h'H 
LULLy L La U L 


a. y L La. U L L Ua 


s^pfpapprrrr 
Ct L l LctctLLyy 


Lyyctyct l y ct l 


4- rf 3 r , r'p , 3 p« i - pf 

L y Cl L L LCI L L 


y l l a a a a >a 


31416 


acgfccaacga 


cct t ct cctg 


gacatggacg 


gccgcgcctc 


ggagcagega 


r~> 4— c* pf p« p« p< a r> 
L LLy LLLctctL 


SI 4 7 
0 j- ft / 0 


4™ 4~ r~i i — i — * H — 4— ^ 

LLCyCdLtCy 


ccagcagcay 


gagagagecg 


LLddggdy LL 


rrr , a rtrra pf pf p^ 

y Ldy y duyy l 


afa rfppa'hpp 
ct Lcty L L ct LLL 


SI SSfi 
0 1 J 0 u 


accagtgcaa 


gaaaggcauc 


4— 4— 4— 4— 

l l c Lgcc L»gg 


Lgaaacaggc 


caagaLLULL 


+■ a r< <~f sa /~r <~r 4~ p« 

Let Ly ct y y L Let 


SI "nQ^ 


P* P< ^ V*S rf^f ~> 

L.L-L.L.yaL/Uya 


i-t f-% ^> 4— j— • /-f f— < 4- 
ULd LLy LL L L 


LLLLctLyctyL 


T" P 1 P 1 1 - Pf P* 3 Pf P* 3 

LLL Ly Lcty Let 


pfp , pfp*p*^ pf ^a pr 
yLyLLctyetcty 


4- 4- i-» ra p~ • p^ 4~ Pf p* p* 
L LLaLLLyLL 


31656 


4- ^-f c^f 4— p/vpfa pt4p 

uy y L.uy yay l 


LCtaL.LUL.Cl LL 


y LLCtLLCtLLL 


=3 pfp<rs pfi~ P«PfPfPf 
ct y Lcty LLyy y 


Lycl LCtLLCtCiy 


pf^f^"^~^f^ , a , t"^ , ^ , 

Ljy y uv) 1 — • ca. u. ^ u 


31716 


a p 1 4/- pf p" 1 "t - (~«r*»"t~/T 
ClL. L.y v — lul. Ly 


Ly O.LLLLULL 


Pf3 P*"fp PfP'Pf'i - P^P* 

yctLLyLy lll 


Ct LCI 1 — ■ 1 — — • l, y Cl l— 


p* ^ p> pf 3 p* c* c^r c 
cl y ci 1 — - v_« i — ^ — ■ 


"t"Pfr , rrnTT , +"r , r' 

V*J ^ 1 — ■ 


31776 


gcgacctccu 


CCCCatyaaC 


tad LLaCLLL 


LLLclLLLcty L 


y ct d ct L ct ct ct L ct 


LLctLCtLLyaL 


SI RSfi 


gatgatttaa 


ataaaaaata 


atcatttgat 


ttgaaataaa 


gatacaatca 


tattgatgat 


31896 


ttgagtttta 


aaaaataaag 


aatcacttac 


ttgaaatctg 


ataccaggtc 


tctgtccatg 


31956 


ttttctgcca 


acaccacctc 


actcccctct 


tcccagctct 


ggtactgcag 


accccggcgg 


32016 


gctgcaaact 


tcctccacac 


gctgaagggg 


atgtcaaatt 


cctcctgtcc 


ctcaatcttc 


32076 
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attttatctt ctatcag atg tec aaa aag cgc gtc egg gtg gat gat gac 32126 

Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp 
1465 1470 

ttc gac ccc gtc tac ccc tac gat gca gac aac gca ccg acc gtg 32171 

Phe Asp Pro Val Tyr Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val 
1475 1480 1485 

ccc ttc ate aac ccc ccc ttc gtc tct tea gat gga ttc caa gag 32216 

Pro Phe lie Asn Pro Pro Phe Val Ser Ser Asp Gly Phe Gin Glu 
1490 1495 1500 

aag ccc ctg ggg gtg ctg tec ctg cga ctg get gac ccc gtc acc 32261 

Lys Pro Leu Gly Val Leu Ser Leu Arg Leu Ala Asp Pro Val Thr 
1505 1510 1515 

acc aag aac ggg gaa ate acc etc aag ctg gga gag ggg gtg gac 32306 

Thr Lys Asn Gly Glu lie Thr Leu Lys Leu Gly Glu Gly Val Asp 
1520 1525 1530 

etc gac tec teg gga aaa etc ate tec aac acg gee acc aag gee 32351 

Leu Asp Ser Ser Gly Lys Leu lie Ser Asn Thr Ala Thr Lys Ala 
1535 1540 1545 

gec gee cct etc agt ttt tec aac aac acc att tec ctt aac atg 32396 

Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr lie Ser Leu Asn Met 
1550 1555 1560 

gat acc cct ctt tat acc aaa gat gga aaa tta tec tta caa gtt 32441 

Asp Thr Pro Leu Tyr Thr Lys Asp Gly Lys Leu Ser Leu Gin Val 
1565 1570 1575 

tct cca ccg tta aac ata tta aaa tea acc att ctg aac aca tta 32486 

Ser Pro Pro Leu Asn lie Leu Lys Ser Thr lie Leu Asn Thr Leu 
1580 1585 1590 

get gta get tat gga tea ggt tta gga ctg agt ggt ggc act get 32531 

Ala Val Ala Tyr Gly Ser Gly Leu Gly Leu Ser Gly Gly Thr Ala 
1595 1600 1605 

ctt gca gta cag ttg gec tct cca etc act ttt gat gaa aaa gga 32576 

Leu Ala Val Gin Leu Ala Ser Pro Leu Thr Phe Asp Glu Lys Gly 
1610 1615 1620 

aat att aaa att aac eta gec agt ggt cca tta aca gtt gat gca 32621 

Asn lie Lys lie Asn Leu Ala Ser Gly Pro Leu Thr Val Asp Ala 
1625 1630 1635 

agt cga ctt agt ate aac tgc aaa aga ggg gtc act gtc act acc 32666 

Ser Arg Leu Ser lie Asn Cys Lys Arg Gly Val Thr Val Thr Thr 
1640 1645 1650 

tea gga gat gca att gaa age aac ata age tgg cct aaa ggt ata 32711 

Ser Gly Asp Ala lie Glu Ser Asn lie Ser Trp Pro Lys Gly lie 
1655 1660 1665 
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aga ttt gaa ggt aat ggc ata get gca aac att ggc aga gga ttg 32756 
Arg Phe Glu Gly Asn Gly lie Ala Ala Asn lie Gly Arg Gly Leu 
1670 1675 1680 

gaa ttt gga acc act agt aca gag act gat gtc aca gat gca tac 32801 
Glu Phe Gly Thr Thr Ser Thr Glu Thr Asp Val Thr Asp Ala Tyr 
1685 1690 1695 

cca att caa gtt aaa ttg ggt act ggc ctt acc ttt gac agt aca 32846 
Pro lie Gin Val Lys Leu Gly Thr Gly Leu Thr Phe Asp Ser Thr 
1700 1705 1710 

ggc gec att gtt get tgg aac aaa gag gat gat aaa ctt aca tta 32891 
Gly Ala lie Val Ala Trp Asn Lys Glu Asp Asp Lys Leu Thr Leu 
1715 1720 1725 

tgg acc aca gec gac ccc teg cca aat tgc aaa ata tac tct gaa 32936 
Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Lys lie Tyr Ser Glu 
1730 1735 1740 

aaa gat gec aaa etc aca ctt tgc ttg aca aag tgt gga agt caa 32981 
Lys Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin 
1745 1750 1755 

att ctg ggt act gtg act gta ttg gca gtg aat aat gga agt etc 33026 
lie Leu Gly Thr Val Thr Val Leu Ala Val Asn Asn Gly Ser Leu 
1760 1765 1770 

aac cca ate aca aac aca gta age act gca etc gtc tec etc aag 33071 
Asn Pro lie Thr Asn Thr Val Ser Thr Ala Leu Val Ser Leu Lys 
1775 1780 1785 

ttt gat gca agt gga gtt ttg eta age age tec aca tta gac aaa 33116 
Phe Asp Ala Ser Gly Val Leu Leu Ser Ser Ser Thr Leu Asp Lys 
1790 1795 1800 

gaa tat tgg aac ttc aga aag gga gat gtt aca cct get gag ccc 33161 
Glu Tyr Trp Asn Phe Arg Lys Gly Asp Val Thr Pro Ala Glu Pro 
1805 1810 ~ 1815 

tat act aat get ata ggt ttt atg cct aac ata aag gee tat cct 33206 
Tyr Thr Asn Ala lie Gly Phe Met Pro Asn lie Lys Ala Tyr Pro 
1820 1825 1830 

aaa aac aca tct gca get tea aaa age cat att gtc agt caa gtt 33251 
Lys Asn Thr Ser Ala Ala Ser Lys Ser His lie Val Ser Gin Val 
1835 1840 1845 

tat etc aat ggg gat gag gee aaa cca ctg atg ctg att att act 33296 
Tyr Leu Asn Gly Asp Glu Ala Lys Pro Leu Met Leu lie lie Thr 
1850 1855 1860 

ttt aat gaa act gag gat gca act tgc acc tac agt ate act ttt 33341 
Phe Asn Glu Thr Glu Asp Ala Thr Cys Thr Tyr Ser lie Thr Phe 
1865 1870 1875 
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caa tgg aaa tgg gat agt act aag tac aca ggt gaa aca ctt act 33^fifi 
Gin Trp Lys Trp Asp Ser Thr Lys Tyr Thr fly Glu Thr Leu Axa 
1880 1885 1890 



acc age tec ttc acc ttc tec tac ate gee caa gaa tga acactgtatc 33435 
Thr Ser Ser Phe Thr Phe Ser Tyr lie Ala Gin Glu 
1895 1900 1905 

ccacectgca tgccaaccct tcecacceca etctgtctat ggaaaaaact ctgaagcaea 33495 
aaataaaata aagttcaagt gttttattga ttcaacagtt ttacaggatt egageagtta 33555 
tttttcctcc accctcccag gacatggaat acaccaccct ctccccccgc acagecttga 33615 
acatctgaat gccattggtg atggacatgc ttttggtcte caegttccac acagttteag 33675 
agegagecag tetegggteg gtcagggaga tgaaaeccte egggeactec cgcatctgca 33735 
cetcacagct caacagetga ggattgtcct cggtggtcgg ga tcacggtt atctggaaga 33795 
agcagaagag eggcggtggg aatcatagtc cgcgaacggg ateggceggt ggtgtcgeat 33855 
caggccccgc ageagtcget gccgecgecg etcegteaag ctgetgetea gggggtcegg 33915 
gtccagggac tccctcagea tgatgcccac ggccctcagc ateagtegtc tggtgcggcg 33975 
ggegeagcag cgcatgcgga tctcgctcag gtegetgcag tacgtgcaae acaggaccac 34035 
caggttgtte aaeagtecat agttcaacac gctccagecg aaaetcatcg egggaaggat 34095 
gctacccaeg tggccgtcgt accagatcct caggtaaatc aagtggcgct ccetecagaa 34155 
cacgctgccc aegtaeatga tctccttggg catgtggcgg tteaecacct cecggtaeca 34215 
catcaccctc tggttgaaca tgcagccccg gatgatcctg cggaaccaca gggccagcac 34275 
cgccccgccc gccatgcagc gaagagaccc egggtccegg caatggeaat ggaggaccca 34335 
cegetegtac ecgtggatca tctgggaget gaacaagtct atgttggcac agcacaggca 34395 
tatgetcatg catctcttea gcaetctcag ctcctcgggg gtcaaaaeca tatcccaggg 34455 
caeggggaae tettgeagga cagcgaaccc cgcagaacag ggeaatcctc gcacataact 34515 
tacattgtge atggacaggg tategcaate aggeagcace gggtgatcct ccaccagaga 34575 
agegegggtc teggtetcet cacagegtgg taagggggee ggccgataeg ggtgatggcg 34 635 
ggacgegget gatcgtgttc gcgaccgtgt catgatgeag ttgetttegg aeattttcgt 34695 
acttgctgta gcagaacctg gtccgggcgc tgcacaccga tcgccggcgg cggtcccggc 34755 
gcttggaacg ctcggtgttg aaattgtaaa acagecactc tcteagaccg tgcagcagat 34815 
etagggcetc aggagtgatg aagatcccat eatgectgat agctetgatc acatcgacca 34875 
ccgtggaatg ggccagaccc agceagatga tgcaattttg ttgggtttcg gtgaeggegg 34935 
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gggagggaag aacaggaaga accatgatta acttttaatc caaacggtct cggagcactt 34995 

caaaatgaag gtcgcggaga tggcacctct cgcccccgct gtgttggtgg aaaataacag 35055 

ccaggtcaaa ggtgatacgg ttctcgagat gttccacggt ggcttccagc aaagcctcca 35115 

cgcgcacatc cagaaacaag acaatagcga aagcgggagg gttctctaat tcctcaatca 35175 

tcatgttaca ctcctgcacc atccccagat aattttcatt tttccagcct tgaatgattc 35235 

gaactagttc ctgaggtaaa tccaagccag ccatgataaa gagctcgcgc agagcgccct 35295 

ccaccggcat tcttaagcac accctcataa ttccaagata ttctgctcct ggttcacctg 35355 

cagcagattg acaagcggaa tatcaaaatc tctgccgcga tccctaagct cctccctcag 35415 

caataactgt aagtactctt tcatatcctc tccgaaattt ttagccatag gaccaccagg 35475 

aataagatta gggcaagcca cagtacagat aaaccgaagt cctccccagt gagcattgcc 35535 

aaatgcaaga ctgctataag catgctggct agacccggtg atatcttcca gataactgga 35595 

cagaaaatca cccaggcaat ttttaagaaa atcaacaaaa gaaaaatcct ccaggtgcac 35655 

gtttagagcc tcgggaacaa cgatgaagta aatgcaagcg gtgcgttcca gcatggttag 35715 

ttagctgatc tgtaaaaaac aaaaaataaa acattaaacc atgctagcct ggcgaacagg 35775 

tgggtaaatc gttctctcca gcaccaggca ggccacgggg tctccggcgc gaccctcgta 35835 

aaaattgtcg ctatgattga aaaccatcac agagagacgt tcccggtggc cggcgtgaat 35895 

gattcgacaa gatgaataca cccccggaac attggcgtcc gcgagtgaaa aaaagcgccc 35955 

gaggaagcaa taaggcacta caatgctcag tctcaagtcc agcaaagcga tgccatgcgg 36015 

atgaagcaca aaatcctcag gtgcgtacaa aatgtaatta ctcccctcct gcacaggcag 36075 

cgaagccccc gatccctcca gatacacata caaagcctca gcgtccatag cttaccgagc 36135 

agcagcacac aacaggcgca agagtcagag aaaggctgag ctctaacctg tccacccgct 36195 

ctctgctcaa tatatagccc agatctacac tgacgtaaag gccaaagtct aaaaataccc 36255 

gccaaataat cacacacgcc cagcacacgc ccagaaaccg gtgacacact caaaaaaata 36315 

cgcgcacttc ctcaaacgcc caaactgccg tcatttccgg gttcccacgc tacgtcatcg 36375 

gaattcgact ttcaaattcc gtcgaccgtt aaaaacgtca cccgccccgc ccctaacggt 36435 

cgcccgtctc tcggccaatc accttcctcc ctccccaaat tcaaacagct catttgcata 36495 

ttaacgcgca ccaaaagttt gaggtatatt attgatgatg 36535 
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<210> 10 

<211> 531 

<212> PRT 

<213> chimpanzee adenovirus serotype Pan7 

<400> 10 

Met Met Arg Arg Val Tyr Pro Glu Gly Pro Pro Pro Ser Tyr Glu Ser 
1 5 10 15 

Val Met Gin Gin Ala Val Ala Ala Ala Met Gin Pro Pro Leu Glu Ala 
20 25 30 

Pro Tyr Val Pro Pro Arg Tyr Leu Ala Pro Thr Glu Gly Arg Asn Ser 
35 40 45 

lie Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Arg Leu Tyr 
50 55 60 

Leu Val Asp Asn Lys Ser Ala Asp He Ala Ser Leu Asn Tyr Gin Asn 
65 70 75 80 

Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp Phe Thr 
85 90 95 

Pro Thr Glu Ala Ser Thr Gin Thr He Asn Phe Asp Glu Arg Ser Ara 
100 105 HO 

Trp Gly Gly Gin Leu Lys Thr He Met His Thr Asn Met Pro Asn Val 
115 120 125 

Asn Glu Phe Met Tyr Ser Asn Lys Phe Lys Ala Arg Val Met Val Ser 
130 135 140 

Arg Lys Thr Pro Asn Gly Val Ala Val Asp Glu Asn Tyr Asp Gly Ser 
145 150 155 160 

Gin Asp Glu Leu Thr Tyr Glu Trp Val Glu Phe Glu Leu Pro Glu Gly 
165 170 175 

Asn Phe Ser Val Thr Met Thr He Asp Leu Met Asn Asn Ala He He 
180 185 190 

Asp Asn Tyr Leu Ala Val Gly Arg Gin Asn Gly Val Leu Glu Ser Asp 
195 200 205 

He Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro 
21° 215 220 

Val Thr Glu Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His 
225 230 235 240 

Pro Asp He Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Glu Ser 
245 250 255 

Arg Leu Ser Asn Leu Leu Gly He Arg Lys Arg Gin Pro Phe Gin Glu 
260 265 270 
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Gly Phe Gin lie Leu Tyr Glu Asp Leu Glu Gly Gly Asn lie Pro Ala 
275 280 285 

Leu Leu Asp Val Glu Ala Tyr Glu Lys Ser Lys Glu Glu Ala Ala Ala 
290 295 300 

Ala Ala Thr Ala Ala Val Ala Thr Ala Ser Thr Glu Val Arg Gly Asp 
305 310 315 ~ ~ 320 

Asn Phe Ala Ser Ala Ala Ala Val Ala Glu Ala Ala Glu Thr Glu Ser 
325 330 335 

Lys lie Val lie Gin Pro Val Glu Lys Asp Ser Lys Asp Arg Ser Tyr 
340 345 350 

Asn Val Leu Ala Asp Lys Lys Asn Thr Ala Tyr Arg Ser Trp Tyr Leu 
355 360 365 

Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr Leu 
370 ' 375 380 

Leu Thr Thr Ser Asp Val Thr Cys Gly Val Glu Gin Val Tyr Trp Ser 
385 390 395 400 

Leu Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg Ser Thr Arg Gin 
405 410 415 

Val Ser Asn Tyr Pro Val Val Gly Ala Glu Leu Leu Pro Val Tyr Ser 
420 425 430 

Lys Ser Phe Phe Asn Glu Gin Ala Val Tyr Ser Gin Gin Leu Arg Ala 
435 440 445 

Phe Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gin He 
450 455 460 

Leu Val Arg Pro Pro Ala Pro Thr He Thr Thr Val Ser Glu Asn Val 
465 470 475 480 

Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser lie Arg 
485 490 495 

Gly Val Gin Arg Val Thr Val Thr Asp Ala Arg Arg Arg Thr Cys Pro 
500 505 510 

Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Arg Val Leu Ser Ser 
515 520 525 

Arg Thr Phe 
530 

<210> 11 

<211> 932 

<212> PRT 

<213> chimpanzee adenovirus serotype Pan7 
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<400> 11 

Met Ala Thr Pro Ser Met Leu Pro Gin Trp Ala Tyr Met His lie Ala 
1 5 10 15 

Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 
20 25 30 

Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 
35 40 45 

Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 
50 55 60 

Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Asn Thr Tyr Ser Tyr 
65 70 75 80 

Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 
85 90 95 

Ala Ser Thr Tyr Phe Asp lie Arg Gly Val Leu Asp Arg Gly Pro Ser 
100 105 ~ 110 

Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 
115 120 125 

Ala Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Gly Asp Thr Asp Thr 
130 135 140 

Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gin Gly lie Ser lie 
145 150 155 160 

Thr Lys Asp Gly lie Gin Leu Gly Thr Asp Ser Asp Gly Gin Ala He 
165 170 175 

Tyr Ala Asp Glu Thr Tyr Gin Pro Glu Pro Gin Val Gly Asp Ala Glu 
180 185 ~ 190 

Trp His Asp He Thr Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala Leu 
195 200 '* 205 

Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro 
210 215 220 

Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys Thr Glu Thr Gly Gly 
225 230 235 240 

Thr Lys Glu Tyr Asp lie Asp Met Ala Phe Phe Asp Asn Arg Ser Ala 
245 250 255 

Ala Ala Ala Gly Leu Ala Pro Glu He Val Leu Tyr Thr Glu Asn Val 
260 265 270 

Asp Leu Glu Thr Pro Asp Thr His He Val Tyr Lys Ala Gly Thr Asp 
275 280 285 
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Asp Ser Ser Ser Ser He Asn Leu Gly Gin Gin Ser Met Pro Asn Arg 
290 295 300 

Pro Asn Tyr He Gly Phe Arg Asp Asn Phe lie Gly Leu Met Tyr Tyr 
305 310 315 ~ "* 320 

Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu 
325 330 335 

Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin 
340 345 350 

Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp 
355 360 ~ 365 

Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg lie lie Glu Asn 
370 375 380 

His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Ala 
385 390 395 400 

Val Gly Arg Thr Asp Thr Tyr Gin Gly He Lys Ala Asn Gly Asp Asn 
405 410 415 

Gin Thr Thr Trp Thr Lys Asp Asp Thr Val Asn Asp Ala Asn Glu Leu 
420 425 430 

Gly Lys Gly Asn Pro Phe Ala Met Glu He Asn He Gin Ala Asn Leu 
435 440 445 

Trp Arg Asn Phe Leu Tyr Ala Asn Val Ala Leu Tyr Leu Pro Asp Ser 
450 455 460 

Tyr Lys Tyr Thr Pro Ala Asn He Thr Leu Pro Thr Asn Thr Asn Thr 
465 470 475 480 

Tyr Asp Tyr Met Asn Gly Arg Val Val Ala Pro Ser Leu Val Asp Ala 
485 490 495 

Tyr He Asn He Gly Ala Arg Trp Ser Leu Asp Pro Met Asp Asn Val 
500 505 510 

Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met 
515 520 525 

Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His He Gin Val Pro Gin 
530 535 540 

Lys Phe Phe Ala He Lys Ser Leu Leu Leu Leu Pro Gly Ser Tyr Thr 
545 550 555 ~ ^ 560 

Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met He Leu Gin Ser Ser 
565 570 575 

Leu Gly Asn Asp Leu Arg Thr Asp Gly Ala Ser He Ala Phe Thr Ser 
580 585 590 
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lie Asn Leu Tyr Ala Thr Phe Phe Pro Met Ala His Asn Thr Ala Ser 
595 600 605 

Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser Phe Asn 
610 615 620 

Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro lie Pro Ala Asn Ala 
625 630 635 640 

Thr Asn Val Pro lie Ser lie Pro Ser Arg Asn Trp Ala Ala Phe Arg 
645 650 ' 655 

Gly Trp Ser Phe Thr Arg Leu Lys Thr Arg Glu Thr Pro Ser Leu Gly 
660 665 670 

Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser lie Pro Tyr Leu 
675 680 685 

Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser He Thr 
690 695 700 

Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro 
705 710 715 " 720 

Asn Glu Phe Glu He Lys Arg Thr Val Asp Gly Glu Gly Tyr Asn Val 
725 730 735 

Ala Gin Cys Asn Met Thr Lys Asp Trp Phe Leu Val Gin Met Leu Ala 
740 745 750 

His Tyr Asn He Gly Tyr Gin Gly Phe Tyr Val Pro Glu Gly Tyr Lys 
755 760 765 

Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg Gin 
770 775 780 

Val Val Asp Glu Val Asn Tyr Lys Asp Tyr Gin Ala Val Thr Leu Ala 
785 790 795 800 

Tyr Gin His Asn Asn Ser Gly Phe Val Gly Tyr Leu Ala Pro Thr Met 
805 810 815 

Arg Gin Gly Gin Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu lie Gly 
820 825 " " 830 

Lys Ser Ala Val Ala Ser Val Thr Gin Lys Lys Phe Leu Cys Asp Arg 
835 840 845 

Val Met Trp Arg He Pro Phe Ser Ser Asn Phe Met Ser Met Gly Ala 
850 855 860 

Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala Asn Ser Ala His Ala 
865 870 ~ 875 880 

Leu Asp Met Asn Phe Glu Val Asp Pro Met Asp Glu Ser Thr Leu Leu 
885 890 895 
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Tyr Val Val Phe Glu Val Phe Asp Val Val Arg Val His Gin Pro His 
900 905 910 

Arg Gly Val lie Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly 
915 920 925 

Asn Ala Thr Thr 
930 



<210> 12 

<211> 443 

<212> PRT 

<213> chimpanzee adenovirus serotype Pan7 

<400> 12 

Met Ser Lys Lys Arg Val Arg Val Asp Asp Asp Phe Asp Pro Val Tyr 
1 5 10 15 

Pro Tyr Asp Ala Asp Asn Ala Pro Thr Val Pro Phe lie Asn Pro Pro 
20 25 30 

Phe Val Ser Ser Asp Gly Phe Gin Glu Lys Pro Leu Gly Val Leu Ser 
35 ^ " 40 45 

Leu Arg Leu Ala Asp Pro Val Thr Thr Lys Asn Gly Glu lie Thr Leu 
50 55 60 

Lys Leu Gly Glu Gly Val Asp Leu Asp Ser Ser Gly Lys Leu lie Ser 
65 " " 70 75 80 

Asn Thr Ala Thr Lys Ala Ala Ala Pro Leu Ser Phe Ser Asn Asn Thr 
85 90 95 

lie Ser Leu Asn Met Asp Thr Pro Leu Tyr Thr Lys Asp Gly Lys Leu 
100 105 110 

Ser Leu Gin Val Ser Pro Pro Leu Asn lie Leu Lys Ser Thr lie Leu 
115 120 125 

Asn Thr Leu Ala Val Ala Tyr Gly Ser Gly Leu Gly Leu Ser Gly Gly 
130 135 140 

Thr Ala Leu Ala Val Gin Leu Ala Ser Pro Leu Thr Phe Asp Glu Lys 
145 150 155 160 

Gly Asn lie Lys lie Asn Leu Ala Ser Gly Pro Leu Thr Val Asp Ala 
165 170 175 

Ser Arg Leu Ser lie Asn Cys Lys Arg Gly Val Thr Val Thr Thr Ser 
180 185 190 

Gly Asp Ala lie Glu Ser Asn lie Ser Trp Pro Lys Gly lie Arg Phe 
195 200 205 
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Glu Gly Asn Gly lie 
210 

Thr Thr Ser Thr Glu 
225 

Lys Leu Gly Thr Gly 
245 

Trp Asn Lys Glu Asp 
260 

Ser Pro Asn Cys Lys 
275 

Cys Leu Thr Lys Cys 
290 

Ala Val Asn Asn Gly 
305 

Ala Leu Val Ser Leu 
325 

Ser Thr Leu Asp Lys 
340 

Pro Ala Glu Pro Tyr 
355 

Ala Tyr Pro Lys Asn 
370 

Gin Val Tyr Leu Asn 
385 

Thr Phe Asn Glu Thr 
405 

Gin Trp Lys Trp Asp 
420 

Ser Ser Phe Thr Phe 
435 



Ala Ala Asn lie Gly Arg 
215 

Thr Asp Val Thr Asp Ala 
230 235 

Leu Thr Phe Asp Ser Thr 
250 

Asp Lys Leu Thr Leu Trp 
265 

lie Tyr Ser Glu Lys Asp 
280 

Gly Ser Gin lie Leu Gly 

295 

Ser Leu Asn Pro lie Thr 
310 315 

Lys Phe Asp Ala Ser Gly 
330 

Glu Tyr Trp Asn Phe Arg 
345 

Thr Asn Ala lie Gly Phe 
360 

Thr Ser Ala Ala Ser Lys 
375 

Gly Asp Glu Ala Lys Pro 
390 395 

Glu Asp Ala Thr Cys Thr 
410 

Ser Thr Lys Tyr Thr Gly 
425 

Ser Tyr lie Ala Gin Glu 
440 



Gly Leu Glu Phe Gly 
220 

Tyr Pro lie Gin Val 

240 

Gly Ala lie Val Ala 
255 

Thr Thr Ala Asp Pro 
270 

Ala Lys Leu Thr Leu 
285 

Thr Val Thr Val Leu 
300 

Asn Thr Val Ser Thr 
320 

Val Leu Leu Ser Ser 
335 

Lys Gly Asp Val Thr 
350 

Met Pro Asn lie Lys 
3 65 

Ser His lie Val Ser 
380 

Leu Met Leu He He 
400 

Tyr Ser lie Thr Phe 
415 

Glu Thr Leu Ala Thr 
430 



<210> 13 

<211> 338 

<212> PRT 

<213> simian serotype CI 

<400> 13 



Ala Pro Lys Gly Ala Pro Asn Thr Ser Gin Trp Leu Asp Lys Gly Val 
15 10 15 
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Thr Thr Thr Asp Asn Asn Thr Glu Asn Gly Asp Glu Glu Asp Glu Val 
20 25 30 

Ala Glu Glu Gly Glu Glu Glu Lys Gin Ala Thr Tyr Thr Phe Gly Asn 
35 40 45 

Ala Pro Val Lys Ala Glu Ala Glu lie Thr Lys Glu Gly Leu Pro lie 
50 ' 55 60 

Gly Leu Glu Val Pro Ser Glu Gly Asp Pro Lys Pro lie Tyr Ala Asp 
65 70 75 80 

Lys Leu Tyr Gin Pro Glu Pro Gin Val Gly Glu Glu Ser Trp Thr Asp 
85 90 95 

Thr Asp Gly Thr Asp Glu Lys Tyr Gly Gly Arg Ala Leu Lys Pro Glu 
100 105 110 

Thr Lys Met Lys Pro Cys Tyr Gly Ser Phe Ala Lys Pro Thr Asn Val 
115 ~ 120 125 

Lys Gly Gly Gin Ala Lys Val Lys Lys Val Glu Glu Gly Lys Val Glu 
130 ~ 135 140 

Tyr Asp lie Asp Met Asn Phe Phe Asp Leu Arg Ser Gin Lys Thr Gly 
145 " 150 155 160 

Leu Lys Pro Lys lie Val Met Tyr Ala Glu Asn Val Asp Leu Glu Thr 
165 170 175 



Pro Asp Thr His Val Val Tyr Lys 
180 

His Ala Asn Leu Gly Gin Gin Ser 

195 200 



Pro Gly Ala Ser Asp Ala Ser Ser 
185 190 

Met Pro Asn Arg Pro Asn Tyr lie 
205 



Gly Phe Arg Asp Asn Phe lie Gly Leu Met Tyr Tyr Asn Ser Thr Gly 
210 215 220 

Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala Val Val 
225 230 235 240 

Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Leu Leu Asp 
245 250 255 

Ser Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gin Ala Val 
260 265 270 

Asp Ser Tyr Asp Pro Asp Val Arg Val lie Glu Asn His Gly Val Glu 
275 ~ 280 285 

Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Asp Gly Val Gly Pro Arg 
290 295 300 

Thr Asp Ser Tyr Lys Gly lie Glu Thr Asn Gly Asp Glu Asn Thr Thr 
305 310 315 320 
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Trp Lys Asp Leu Asp Pro Asn Gly lie Ser Glu Leu Ala Lys Gly Asn 
325 330 335 

Pro Phe 



<210> 14 
<211> 315 
<212> PRT 

<213> chimpanzee adenovirus Pan- 9 
<400> 14 

Ala Pro Lys Gly Ala Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Asp 
15 10 " 15 

Gly Glu Thr Ala Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val 
20 25 30 

Gin Gly lie Asn lie Thr Lys Asp Gly lie Gin Leu Gly Thr Asp Thr 
35 40 45 

Asp Asp Gin Pro lie Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin 
50 55 60 

Val Gly Asp Ala Glu Trp His Asp lie Thr Gly Thr Asp Glu Lys Tyr 
65 70 75 80 

Gly Gly Arg Ala Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly 
85 90 95 

Ser Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys 
100 105 110 

Thr Gly Thr Gly Thr Thr Lys Glu Tyr Asp lie Asp Met Ala Phe Phe 
115 120 125 

Asp Asn Arg Ser Ala Ala Ala Ala Gly Leu Ala Pro Glu lie Val Leu 
130 135 140 

Tyr Thr Glu Asn Val Asp Leu Glu Thr Pro Asp Thr His lie Val Tyr 
145 150 155 160 

Lys Ala Gly Thr Asp Asp Ser Ser Ser Ser lie Asn Leu Gly Gin Gin 
165 170 ^ 175 

Ala Met Pro Asn Arg Pro Asn Tyr lie Gly Phe Arg Asp Asn Phe lie 
180 185 190 

Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly 
195 200 ~ 205 

Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr 
210 215 220 



105 



WO 03/046124 



PCT/US02/33645 



Glu Leu Ser Tyr Gin 
225 

Tyr Phe Ser Met Trp 
245 

Arg lie lie Glu Asn 
260 

Phe Pro Leu Asp Ala 
275 

Ala Asn Gly Thr Asp 
290 

Asp Ala Asn Glu lie 
305 



Leu Leu Leu Asp Ser 
230 

Asn Gin Ala Val Asp 
250 

His Gly Val Glu Asp 
265 

Val Gly Arg Thr Asp 
280 

Gin Thr Thr Trp Thr 
295 

Gly Lys Gly Asn Pro 
310 



Leu Gly Asp Arg Thr Arg 
235 *" 240 

Ser Tyr Asp Pro Asp Val 
255 

Glu Leu Pro Asn Tyr Cys 
270 

Thr Tyr Gin Gly lie Lys 
285 

Lys Asp Asp Ser Val Asn 
300 

Phe 
315 



<210> 15 
<211> 315 
<212> PRT 

<213> chimpanzee adenovirus Pan-5 
<400> 15 

Ala Pro Lys Gly Ala Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Asp 
15 10 15 

Gly Asp Thr Gly Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val 
20 25 30 

Gin Gly lie Ser lie Thr Lys Asp Gly He Gin Leu Gly Thr Asp Thr 
35 40 45 

Asp Asp Gin Pro He Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin 
50 55 60 

Val Gly Asp Ala Glu Trp His Asp He Thr Gly Thr Asp Glu Lys Tyr 
65 70 75 80 

Gly Gly Arg Ala Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly 
85 90 95 

Ser Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys 
100 105 110 

Thr Glu Thr Gly Gly Thr Lys Glu Tyr Asp lie Asp Met Ala Phe Phe 
115 120 125 

Asp Asn Arg Ser Ala Ala Ala Ala Gly Leu Ala Pro Glu He Val Leu 
130 135 140 

Tyr Thr Glu Asn Val Asp Leu Glu Thr Pro Asp Thr His He Val Tyr 
145 150 155 160 
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Lys Ala Gly Thr 


Asp 
165 


Asp 


Ser 


Ser 


Ser 


Ser 
170 


He 


Asn Leu 


Gly 


Gin 
175 


Ser Met Pro 


Asn 
180 


Arg 


Pro 


Asn 


Tyr 


He 
185 


Gly 


Phe 


Arg Asp 


Asn 
190 


Phe 


Gly Leu Met 
195 


Tyr 


Tyr 


Asn 


Ser 


Thr 
200 


Gly Asn 


Met 


Gly Val 
205 


Leu 


Ala 


Gin Ala Ser 
210 


Gin 


Leu 


Asn 


Ala 
215 


Val 


Val 


Asp 


Leu 


Gin Asp 
220 


Arg Asn 


Glu Leu Ser 
225 


Tyr 


Gin 


Leu 
230 


Leu 


Leu 


Asp 


Ser 


Leu 
235 


Gly Asp Arg 


Thr 


Tyr Phe Ser 


Met 


Trp 
245 


Asn 


Gin 


Ala 


Val 


Asp 
250 


Ser 


Tyr Asp 


Pro 


Asp 
255 


Arg lie lie 


Glu 
260 


Asn 


His 


Gly 


Val 


Glu 
265 


Asp 


Glu 


Leu Pro 


Asn 
270 


Tyr 


Phe Pro Leu Asp 
275 


Ala 


Val 


Gly 


Arg 
280 


Thr 


Asp 


Thr 


Tyr Gin 
285 


Gly 


He 


Ala Asn Gly Ala Asp 
290 


Gin 


Thr 
295 


Thr 


Trp 


Thr 


Lys 


Asp Asp 
300 


Thr 


Val 


Asp Ala Asn 
305 


Glu 


Leu 


Gly 
310 


Lys 


Gly 


Asn 


Pro 


Phe 
315 








<210> 16 
<211> 324 
<212> PRT 

<213> chimpanzee adenovirus Pan- 6 












<400> 16 
























Ala Pro Lys 
1 


Gly 


Ala 
5 


Pro 


Asn 


Ser 


Ser 


Gin 
10 


Trp 


Glu Gin 


Ala 


Lys 
15 


Gly Asn Gly 


Gly 
20 


Thr 


Met 


Glu 


Thr 


His 

25 


Thr 


Tyr 


Gly Val 


Ala 
30 


Pro 


Gly Gly Glu 
35 


Asn 


He 


Thr 


Lys 


Asp 
40 


Gly 


Leu 


Gin 


He Gly 
45 


Thr Asp 


Thr Ala Asn 
50 


Gin 


Asn 


Lys 


Pro 
55 


He 


Tyr 


Ala 


Asp 


Lys Thr 
60 


Phe 


Gin 


Glu Pro Gin 
65 


Val 


Gly 


Glu 
70 


Glu 


Asn 


Trp 


Gin 


Glu 
75 


Thr Glu 


Asn 


Phe 


Gly Gly Arg 


Ala 


Leu 
85 


Lys 


Lys 


Asp 


Thr 


Lys 
90 


Met 


Lys Pro 


Cys 


Tyr 
95 



240 



80 
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Ser Tyr Ala Arg Pro Thr Asn Glu Lys Gly Gly Gin Ala Lys Leu Lys 
100 105 110 

Val Gly Asp Asp Gly Val Pro Thr Lys Glu Phe Asp He Asp Leu Ala 
115 120 125 

Phe Phe Asp Thr Pro Gly Gly Thr Val Asn Gly Gin Asp Glu Tyr Lys 
130 135 140 

Ala Asp He Val Met Tyr Thr Glu Asn Thr Tyr Leu Glu Thr Pro Asp 
145 150 155 160 

Thr His Val Val Tyr Lys Pro Gly Lys Asp Asp Ala Ser Ser Glu He 
165 170 175 

Asn Leu Val Gin Gin Ser Met Pro Asn Arg Pro Asn Tyr lie Gly Phe 
180 185 190 

Arg Asp Asn Phe He Gly Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met 
195 200 205 

Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala Val Val Asp Leu 
210 215 220 

Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Leu Leu Asp Ser Leu 
225 230 235 240 

Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gin Ala Val Asp Ser 
245 250 255 

Tyr Asp Pro Asp Val Arg He He Glu Asn His Gly Val Glu Asp Glu 
260 265 270 

Leu Pro Asn Tyr Cys Phe Pro Leu Asp Gly Ser Gly Thr Asn Ala Ala 
275 280 285 

Tyr Gin Gly Val Lys Val Lys Asp Gly Gin Asp Gly Asp Val Glu Ser 
290 295 300 

Glu Trp Glu Asn Asp Asp Thr Val Ala Ala Arg Asn Gin Leu Cys Lys 
305 310 315 320 



Gly Asn He Phe 



<210> 17 

<211> 314 

<212> PRT 

<213> chimpanzee adenovirus Pan-7 

<400> 17 

Ala Pro Lys Gly Ala Pro Asn Thr Cys Gin Trp Thr Tyr Lys Ala Gly 
1 5 10 15 

Asp Thr Asp Thr Glu Lys Thr Tyr Thr Tyr Gly Asn Ala Pro Val Gin 
20 25 30 
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Gly lie Ser lie Thr Lys Asp Gly lie Gin Leu Gly Thr Asp Ser Asp 
35 ~ 4 0 45 

Gly Gin Ala lie Tyr Ala Asp Glu Thr Tyr Gin Pro Glu Pro Gin Val 
50 ~ 55 60 

Gly Asp Ala Glu Trp His Asp lie Thr Gly Thr Asp Glu Lys Tyr Gly 
65 70 75 80 

Gly Arg Ala Leu Lys Pro Asp Thr Lys Met Lys Pro Cys Tyr Gly Ser 
85 90 95 

Phe Ala Lys Pro Thr Asn Lys Glu Gly Gly Gin Ala Asn Val Lys Thr 
100 105 110 

Glu Thr Gly Gly Thr Lys Glu Tyr Asp lie Asp Met Ala Phe Phe Asp 
115 ^ "* 12 0 " 12 5 

Asn Arg Ser Ala Ala Ala Ala Gly Leu Ala Pro Glu lie Val Leu Tyr 
130 135 140 

Thr Glu Asn Val Asp Leu Glu Thr Pro Asp Thr His lie Val Tyr Lys 
145 150 155 160 

Ala Gly Thr Asp Asp Ser Ser Ser Ser lie Asn Leu Gly Gin Gin Ser 
165 170 175 

Met Pro Asn Arg Pro Asn Tyr lie Gly Phe Arg Asp Asn Phe lie Gly 
180 185 190 

Leu Met Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin 
195 200 205 

Ala Ser Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu 
210 215 220 

Leu Ser Tyr Gin Leu Leu Leu Asp Ser Leu Gly Asp Arg Thr Arg Tyr 
225 ~ 230 235 240 

Phe Ser Met Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg 
245 250 255 

lie lie Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe 
260 265 270 

Pro Leu Asp Ala Val Gly Arg Thr Asp Thr Tyr Gin Gly lie Lys Ala 
275 ~ 280 285 

Asn Gly Asp Asn Gin Thr Thr Trp Thr Lys Asp Asp Thr Val Asn Asp 
290 295 300 

Ala Asn Glu Leu Gly Lys Gly Asn Pro Phe 
305 310 
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<210> 18 

<211> 179 

<212> PRT 

<213> chimpanzee adenovirus Pan9 

<400> 18 

Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Gin lie Leu Ala* 
1 5 10 15 

Glu Asn Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin 
20 25 30 

lie Leu Ala Thr Val Ser Val Leu Val Val Gly Ser Gly Asn Leu Asn 
35 40 45 

Pro lie Thr Gly Thr Val Ser Ser Ala Gin Val Phe Leu Arg Phe Asp 
50 ^ 55 60 

Ala Asn Gly Val Leu Leu Thr Glu His Ser Thr Leu Lys Lys Tyr Trp 
65 70 75 80 

Gly Tyr Arg Gin Gly Asp Ser lie Asp Gly Thr Pro Tyr Thr Asn Ala 
85 90 95 

Val Gly Phe Met Pro Asn Leu Lys Ala Tyr Pro Lys Ser Gin Ser Ser 
100 ~ 105 " 110 

Thr Thr Lys Asn Asn lie Val Gly Gin Val Tyr Met Asn Gly Asp Val 
115 120 125 

Ser Lys Pro Met Leu Leu Thr lie Thr Leu Asn Gly Thr Asp Asp Ser 
130 135 140 

Asn Ser Thr Tyr Ser Met Ser Phe Ser Tyr Thr Trp Thr Asn Gly Ser 
145 150 155 160 

Tyr Val Gly Ala Thr Phe Gly Ala Asn Ser Tyr Thr Phe Ser Tyr lie 
165 ~ 170 175 



Ala Gin Glu 



<210> 19 

<211> 185 

<212> PRT 

<213> chimpanzee adenovirus Pan6 

<400> 19 

Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Gin Leu Leu Ser 
1 5 10 15 

Asp Arg Asp Ala Lys Phe Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin 
20 25 30 
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lie Leu Gly Thr Val Ala Val Ala Ala Val Thr Val Gly Ser Ala Leu 
35 40 45 

Asn Pro lie Asn Asp Thr Val Lys Ser Ala He Val Phe Leu Arg Phe 
50 55 60 

Asp Ser Asp Gly Val Leu Met Ser Asn Ser Ser Met Val Gly Asp Tyr 
65 70 75 ' • 80 

Trp Asn Phe Arg Glu Gly Gin Thr Thr Gin Ser Val Ala Tyr Thr Asn 
85 90 95 

Ala Val Gly Phe Met Pro Asn He Gly Ala Tyr Pro Lys Thr Gin Ser 
100 105 HO 

Lys Thr Pro Lys Asn Ser He Val Ser Gin Val Tyr Leu Thr Gly Glu 
115 120 125 

Thr Thr Met Pro Met Thr Leu Thr He Thr Phe Asn Gly Thr Asp Glu 
130 * 135 140 

Lys Asp Thr Thr Pro Val Ser Thr Tyr Ser Met Thr Phe Thr Trp Gin 
145 150 155 160 

Trp Thr Gly Asp Tyr Lys Asp Lys Asn He Thr Phe Ala Thr Asn Ser 
165 170 175 

Phe Ser Phe Ser Tyr He Ala Gin Glu 
180 185 



<210> 20 
<211> 179 
<212> PRT 

<213> chimpanzee adenovirus Pan7 
<400> 20 

Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Lys He Tyr Ser 
15 10 15 

Glu Lys Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin 
20 25 ~ 30 

He Leu Gly Thr Val Thr Val Leu Ala Val Asn Asn Gly Ser Leu Asn 
35 40 45 

Pro He Thr Asn Thr Val Ser Thr Ala Leu Val Ser Leu Lys Phe Asp 
50 55 60 

Ala Ser Gly Val Leu Leu Ser Ser Ser Thr Leu Asp Lys Glu Tyr Trp 
65 70 75 80 

Asn Phe Arg Lys Gly Asp Val Thr Pro Ala Glu Pro Tyr Thr Asn Ala 
85 90 " 95 
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lie Gly Phe Met Pro Asn lie Lys 
100 



Ala Ser Lys Ser His lie Val Ser 

115 120 

Ala Lys Pro Leu Met Leu He He 

130 135 



Thr Cys Thr Tyr Ser He Thr Phe 
145 150 

Tyr Thr Gly Glu Thr Leu Ala Thr 
165 



Ala Gin Glu 



Ala Tyr Pro Lys Asn Thr Ser Ala 
105 no 

Gin Val Tyr Leu Asn Gly Asp Glu 
125 

Thr Phe Asn Glu Thr Glu Asp Ala 
140 

Gin Trp Lys Trp Asp Ser Thr Lys 
155 160 

Ser Ser Phe Thr Phe Ser Tyr He 
170 175 



<210> 21 
<211> 179 
<212> PRT 

<213> chimpanzee adenovirus Pan5 
<400> 21 

Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys His He Tyr Ser 
15 10 15 

Glu Lys Asp Ala Lys Leu Thr Leu Cys Leu Thr Lys Cys Gly Ser Gin 
20 25 ~ 30 

He Leu Gly Thr Val Ser Leu He Ala Val Asp Thr Gly Ser Leu Asn 
35 40 45 

Pro He Thr Gly Thr Val Thr Thr Ala Leu Val Ser Leu Lys Phe Asp 
50 55 60 

Ala Asn Gly Val Leu Gin Ser Ser Ser Thr Leu Asp Ser Asp Tyr Trp 
65 70 75 ~ 80 

Asn Phe Arg Gin Gly Asp Val Thr Pro Ala Glu Ala Tyr Thr Asn Ala 
85 90 95 

He Gly Phe Met Pro Asn Leu Lys Ala Tyr Pro Lys Asn Thr Ser Gly 
100 105 HO 

Ala Ala Lys Ser His He Val Gly Lys Val Tyr Leu His Gly Asp Thr 
US 120 125 

Gly Lys Pro Leu Asp Leu He He Thr Phe Asn Glu Thr Ser Asp Glu 
130 135 140 

Ser Cys Thr Tyr Cys He Asn Phe Gin Trp Gin Trp Gly Ala Asp Gin 
i45 150 155 ~ 160 

Tyr Lys Asn Glu Thr Leu Ala Val Ser Ser Phe Thr Phe Ser Tyr He 
165 170 175 
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Ala Lys Glu 



<210> 22 

<211> 183 

<212> PRT 

<213> human adenovirus Ad 2 

<400> 22 

Thr Leu Trp Thr Thr Pro Asp Pro Ser Pro Asn Cys Arg lie His Ser 
1 5 10 15 

Asp Asn Asp Cys Lys Phe Thr Leu Val Leu Thr Lys Cys Gly Ser Gin 
20 25 ~ 30 

Val Leu Ala Thr Val Ala Ala Leu Ala Val Ser Gly Asp Leu Ser Ser 
35 40 k 45 

Met Thr Gly Thr Val Ala Ser Val Ser lie Phe Leu Arg Phe Asp Gin 
50 55 60 

Asn Gly Val Leu Met Glu Asn Ser Ser Leu Lys Lys His Tyr Trp Asn 
65 7 0 75 8 0 

Phe Arg Asn Gly Asn Ser Thr Asn Ala Asn Pro Tyr Thr Asn Ala Val 
85 90 95 

Gly Phe Met Pro Asn Leu Leu Ala Tyr Pro Lys Thr Gin Ser Gin Thr 
100 105 110 

Ala Lys Asn Asn lie Val Ser Gin Val Tyr Leu His Gly Asp Lys Thr 
115 120 125 

Lys Pro Met lie Leu Thr lie Thr Leu Asn Gly Thr Ser Glu Ser Thr 
130 135 ' 140 

Glu Thr Ser Glu Val Ser Thr Tyr Ser Met Ser Phe Thr Trp Ser Trp 
145 150 155 160 

Glu Ser Gly Lys Tyr Thr Thr Glu Thr Phe Ala Thr Asn Ser Tyr Thr 
165 170 175 

Phe Ser Tyr lie Ala Gin Glu 
180 

<210> 23 
<211> 182 
<212> PRT 

<213> human adenovirus Ad 5 
<400> 23 

Thr Leu Trp Thr Thr Pro Ala Pro Ser Pro Asn Cys Arg Leu Asn Ala 
15 10 15 
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Glu Lys Asp Ala Lys Leu Thr Leu 
20 

He Leu Ala Thr Val Ser Val Leu 

35 40 

He Ser Gly Thr Val Gin Ser Ala 
5 0 55 

Asn Gly Val Leu lie Asn Asn Ser 
65 70 

Phe Arg Asn Gly Asp Leu Thr Glu 
85 



Val Leu Thr Lys Cys Gly Ser Gin 
25 30 

Ala Val Lys Gly Ser Leu Ala Pro 
45 

His Leu lie lie Arg Phe Asp Glu 
60 

Phe Leu Asp Pro Glu Tyr Trp Asn 
75 " 80 

Gly Thr Ala Tyr Thr Asn Ala Val 
90 95 



Gly Phe Met Pro Asn Leu Ser Ala Tyr Pro* Lys Ser His Gly Lys Thr 
100 105 110 

Ala Lys Ser Asn lie Val Ser Gin Val Tyr Leu Asn Gly Asp Lys Thr 
115 120 125 

Lys Pro Val Thr Leu Thr lie Thr Leu Asn Gly Thr Gin Glu Thr Gly 
130 135 140 

Asp Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser Trp Asp Trp Ser 
145 150 155 160 

Gly His Asn Tyr lie Asn Glu lie Phe Ala Thr Ser Ser Tyr Thr Glu 
165 170 175 



Ser Tyr He Ala Gin Glu 
180 



<210> 24 

<211> 34264 

<212> DNA 

<213> simian adenovirus SV-1 

<220> 

<221> CDS 

<222> (12454) . . (13965) 

<223> L2 Penton 

<220> 

<221> CDS 

<222> (16841) . . (19636) 

<223> L3 Hexon 

<220> 

<221> CDS 

<222> (28059) . . (29150) 

<223> L5 Fiber #2 
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<220> 

<221> CDS 

<222> (29183) . . (30865) 
<223> L5 Fiber #1 

<400> 24 

tccttattct ggaaacgtgc caatatgata atgagcgggg aggagcgagg cggggccggg 60 

gtgacgtgcg gtgacgtggg gtgacgcggg gtggcgcgag ggcggggcgg gagtggggag 12 0 

gcgcttagtt tttacgtatg cggaaggagg ttttataccg gaagttgggt aatttgggcg 180 

tatacttgta agttttgtgt aatttggcgc gaaaaccggg taatgaggaa gttgaggtta 240 

atatgtactt tttatgactg ggcggaattt ctgctgatca gcagtgaact ttgggcgctg 300 

acggggaggt ttcgctacgt ggcagtacca cgagaaggct caaaggtccc atttattgta 360 

ctcctcagog ttttcgctgg gtatttaaac gctgtcagat catcaagagg ccactcttga 420 

gtgccggcga gtagagtttt ctcctccgcg ctgccgcgat gaggctggtt cccgagatgt 480 

acggtgtttt ctgcagcgag acggcccgga actcagatga gctgcttaat acagatctgc 54 0 

tggatgttcc caactcgoct gtggcttcgc ctccgtcgct tcatgatctt ttcgatgtgg 600 

aagtggatcc accgcaagat cccaacgagg acgcggtaaa cagtatgttc cctgaatgtc 660 

tgtttgaggc ggctgaggag ggttctcaca gcagtgaaga gagcagacgg ggagaggaac 720 

tggacttgaa atgctacgag gaatgtctgc cttctagcga ttctgaaacg gaacagacag 780 

ggggagacgg ctgtgagtcg gcaatgaaaa atgaacttgt attagactgt ccagaacatc 840 

ctggtcatgg ctgccgtgcc tgtgcttttc atagaaatgc cagcggaaat cctgagactc 900 

tatgtgctct gtgttatctg cgccttacca gcgattttgt atacagtaag taaagtgttt 960 

tcattggcgt acggtagggg attcgttgaa gtgctttgtg acttattatg tgtcattatt 1020 

tctaggtgac gtgtccgacg tggaagggga aggagataga tcaggggctg ctaattctcc 1080 

ttgcactttg ggggctgtgg ttccagttgg catttttaaa ccgagtggtg gaggagaacg 1140 

agccggagga gaccgagaat ctgagagccg gcctggaccc tccagtggaa gactaggtgc 1200 

tgaggatgat cctgaagagg ggactagtgg gggtgctagg aaaaagcaaa aaactgagcc 1260 

tgaacctaga aactttttga atgagttgac tgtaagccta atgaatcggc agcgtcctga 1320 

gacggtgttt tggactgagt tggaggatga gttcaagaag ggggaattaa acctcttgta 1380 

caagtatggg tttgagcagt tgaaaactca ctggttggag ccgtgggagg atatggaaat 1440 

ggctctagac acctttgcta aagtggctct gcggccggat aaagtttaca ctattcgccg 1500 
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gactcggccg 


cgtgtccctt 


4320 


ggcgcgcagc 


tttcccttgg 


aaacgtgctg 


acatttggtg 


cagtgcagac 


acttgagggc 


4380 


gtagagtttt 


ggggccagga 


agaccgactc 


gggcgagtag 


gcgtcggctc 


cgcactgagc 


4440 


gcagacggtc 


tcgcactcca 


ccagccacgt 


gagctcgggt 


ttagcgggat 


caaaaaccaa 


4500 


gttgcctcca 


ttttttttga 


tgcgtttctt 


accttgcgtc 


tccatgagtc 


tgtgtcccgc 


4560 


ttccgtgaca 


aaaaggctgt 


cggtatcccc 


gtagaccgac 


ttgagggggc 


gatcttccaa 


4620 


aggtgttccg 


aggtcttccg 


cgtacaggaa 


ctgggaccac 


tccgagacaa 


aggctcgggt 


4680 


ccaggctaac 


acgaaggagg 


cgatctgcga 


ggggtatctg 


tcgttttcaa 


tgagggggtc 


4740 


caccttttcc 


agggtgtgca 


gacacaggtc 


gtcctcctcc 


gcgtccacga 


aggtgattgg 


4800 


cttgtaagtg 


taggtcacgt 


gacccgcacc 


cccccaaggg 


gtataaaagg 


gggcgtgccc 


4860 


actctccccg 


tcactttctt 


ccgcatcgct 


gtggaccaga 


gccagctgtt 


cgggtgagta 


4920 
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ggccctctca 


aaagccggca 


tgatttcggc 


get caagttg 


ucaguutcua 


ca a a cga ggt. 


a q ft n 

ft z> o U 


ggatttgata 


ttcacgtgcc 


ccgcggcgat 


gcttttgatg 


gtggaggggt 


cca uccga L.C 




agaaaacacg 


_4__,4_j i_ 4- 4— — 4_ 

atctttttat 


tgtcaagttt 


ggt ggcgaaa 


gaecegtaga 


ggg cgttgga 


ci on 


aagcaacttg 


gcgatggagc 


gcagggtctg 


_4_4_4_4-4_,_,4-._,_ 

atttttctcc 


cgat cggccc 


t ct ccttggc 




ggcgatgttg 


agttgcacgt 


actcgcgggc 


cacg caccgc 


ca ct egggga 


acacggcggi. 


coon 

ozz u 


gcgctcgtcg 


ggcaggatgc 


gcacgcgcca 


gccgcggttg 


tgcagggtga 


tgaggtccac 


coon 


gctggtggcc 


acct ccccgc 


ggaggggct c 


gttggt ccaa 


cacaa ucgee 


/-* si /—> r~ t4* , 4~4~4—/~ «4— 
CCCCLLULCL 




ggagcagaac 


ggagg caggg 


gatctagcaa 


gttggcgggc 




4— / — i rf^f *s 4~ r~f 4~ 

eg ucga ugg u 


c/nn 


aaatatgccg 


ggtagcagaa 


j_ 4_ 4- 4— _ 4_ 4 , — — , 

ttttattaaa 


ataat cgatt 


t cggtgt ccg 


tgt ct tgcaa 


c /(rn 
O 41 O vj 


cgcgtcttcc 


cacttcttca 


ccgc caggg c 


ectttegtag 


ggatt caggg 


gcggtcccca 


ccon 
D DZ U 


gggcatgggg 


tgggt caggg 


ccgaggcgta 


catgccgcag 


atgtcgtaca 


eg tacagggg 


ccpn 
jjOU 


ct ccctcaac 


accccgatgt 


aagtggggta 


acagcgcccc 


ccgcgyatgc 


4- f-i 4~ /- • /*t f* a 

uggc ucgcdL. 


JDIU 


gtagtcgtac 


atctcgtgag 


agggagccat 


gagecegt ct 


cccaagtggg 


t cttgt gggg 


conn 
J / uu 


4— 4— 4- -I— 4— _ _ _ _ _ 

tttttcggcc 


cggtagagga 


tctgcctgaa 


gatggcgtgg 


gagttggaag 


a gat agtggg 


"7 ^ n 
o / o u 


gcgttggaag 


acgttaaagt 


tggct ccggg 


cagtcccacg 


gagt ctt gga 


tgaactgggc 


coon 


gtaggattcc 


cggagcttgt 


ccaccagggc 


t gcggtt a cc 


ageaegt cga 


gagegcagta 


^ p p n 


gtccaacgtc 


tcgcg gacca 


ggtt gtaggc 


cgt ct cttgt 


4-4-4-4-4- _4-___ 


acagti-cgcg 


CQ/in 
O u 


attgaggagg 


tatt cctcgc 


ggt ctttcca 


gtactcttcg 


gcgggaaat c 


^— 4-- ^— 4— /«r 4-* /— < 

ctuiT-tcguc 


DUUU 


cgct cggtaa 


gaacctaaca 


tgtaaaattc 


guucacggcu 


4— 4— 4— --\ 4— ^ 

t ugtaLggac 


aacagccuL-L. 


o u o u 


tt ctaccggc 


agggcgtacg 


cttgagcggc 


, — « 4— 4— 4— /—i 4— —n 

uttLCtgaga 


gagguguggg 


uy ay y y llj da 


CI Of) 
D J_ U 


ggtgt cccgc 


aCCdLCaCLU 


tcaggtacLy 


augT-.uL.gaay 


4— — — .~.4— « 4- /i«"f- 

LL-LLj u y Luy u 


<— y Lay y oy 




ctgttcccac 


agcgtgaagt 


cggt gcgctt 


4*" 4— 4— /— • 4— *^ i— 1 4— 
LLLCLyCCLg 


rf rr -n 4~" 4~ /~f s~f t~t /~f d 

ggauL-gygga 


y y y uy aa uy l. 


VJ Z. *± VJ 


gacgtcgtta 


aagaggattt 


tcccggcgcg 


gggcatgaag 


ttgegagaga 


4~ 4 — /-*r "» /^r /-w 

Lcctgaaggg 


(conn 


tccgggcacg 


tccgagcggt 


tgtt ga tgac 


ttgcgccgcc 


aggacgaucu 


y — • y^f4~ /— ' ^1 l~f t~* 

CyLCydayuu 


DOOU 


gttgatgttg 


tggcccacga 


tgtaaagttc 


ga taaagege 


ggctgt ccct 


tgagggcegg 


O 4i Z U 


cgcttttttc 


aactcctcgt 


aggtgagaca 


gtceggegag 


gagagaccca 


gctccgcccg 


6480 


ggcccagtcg 


gagagctgag 


ggttagccgc 


gaggaaagag 


ctccacaggt 


caagggctag 


6540 


cagagtttgc 


aagcggtcgc 


ggaactcgcg 


aaactttttc 


cccacggcca 


ttttctcegg 


6600 


cgtcaccacg 


tagaaagtgc 


aggggcggtc 


gttccagacg 


tcccatcgga 


gctctagggc 


6660 
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cagctcgcag 


gcttgacgaa 


cgagggtctc 


ctcgcccgag 


acgtgcatga 


ccagcatgaa 


6720 


gggtaccaac 


tgtttcccga 


acgagcccat 


ccatgtgtag 


gtttctacgt 


cgtaggtgac 


6780 


aaagagccgc 


tgggtgcgcg 


cgtgggagcc 


gatcgggaag 


aagctgatct 


cctgccacca 


6840 


gttggaggaa 


tgggtgfctga 


tgtggtgaaa 


gtagaagtcc 


cgccggcgca 


cagagcattc 


6900 


gtgctgatgt 


ttgtaaaagc 


gaccgcagta 


gtcgcagcgc 


tgcacgctct 


gtatctcctg 


6960 


aatgagatgc 


gcttttcgcc 


cgcgcaccag 


aaaccggagg 


gggaagttga 


gacgggggct 


7020 


tggtggggcg 


gcatcccctt 


cgccttggcg 


gtgggagtct 


gcgtctgcgc 


cctccttctc 


7080 


tgggtggacg 


acggtgggga 


cgacgacgcc 


ccgggtgccg 


caagtccaga 


tctccgccac 


7140 


ggaggggcgc 


aggcgttgca 


ggaggggacg 


cagctgcccg 


ctgtccaggg 


agtcgagggc 


7200 


ggccgcgctg 


aggtcggcgg 


gaagcgtttg 


caagttcact 


ttcagaagac 


cggtaagagc 


7260 


gtgagccagg 


tgcacatggt 


acttgatttc 


caggggggtg 


ttggaagagg 


cgtccacggc 


7320 


gtagaggagg 


ccgtgtccgc 


gcggggccac 


caccgtgccc 


cgaggaggtt 


ttatctcact 


7380 


cgtcgagggc 


gagcgccggg 


gggtagaggc 


ggctctgcgc 


cggggggcag 


cggaggcagt 


7440 


ggcacgtttt 


cgtgaggatt 


cggcagcggt 


tgatgacgag 


cccggagact 


gctggcgtgg 


7500 


gcgacgacgc 


ggcggttgag 


gtcctggatg 


tgccgtctct 


gcgtgaagac 


caccggcccc 


7560 


cgggtcctga 


acctgaaaga 


gagttccaca 


gaatcaatgt 


ctgcatcgtt 


aacggcggcc 


7620 


tgcctgagga 


tctcctgtac 


gtcgcccgag 


ttgtcttgat 


aggcgatctc 


ggccatgaac 


7680 


tgctccactt 


cttcctcgcg 


gaggtcgccg 


tggcccgctc 


gctccacggt 


ggcggccagg 


7740 


tcgttggaga 


tgcgacgcat 


gagttgagag 


aaggcgttga 


ggccgttctc 


gttccacacg 


7800 


cggctgtaca 


ccacgtttcc 


gaaggagtcg 


cgcgctcgca 


tgaccacctg 


ggccacgttg 


7860 


agttccacgt 


ggcgggcgaa 


gacggcgtag 


tttctgaggc 


gctggaagag 


gtagttgagc 


7920 


gtggtggcga 


tgtgctcgca 


gacgaagaag 


tacatgatcc 


agcgccgcag 


ggtcatctcg 


7980 


ttgatgtctc 


cgatggcttc 


gagacgctcc 


atggcctcgt 


agaagtcgac 


ggcgaagttg 


8040 


aaaaattggg 


agttgcgggc 


ggccaccgtg 


agttcttctt 


gcaggaggcg 


gatgagatcg 


8100 


gcgaccgtgt 


cgcgcacctc 


ctgctcgaaa 


gcgccccgag 


gcgcctctgc 


ttcttcctcc 


8160 


ggctcctcct 


cttccagggg 


cacgggttcc 


tccggcagct 


ctgcgacggg 


gacggggcgg 


8220 


cgacgtcgtc 


gtctgaccgg 


caggcggtcc 


acgaagcgct 


cgatcatttc 


gccgcgccgg 


8280 


cgacgcatgg 


tctcggtgac 


ggcgcgtccg 


ttttcgcgag 


gtcgcagttc 


gaagacgccg 


8340 
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/~1 f~* /->f g~t f—f f~~% f+f ~> y-Y 

LCyCgCdyay 


f* ft f* f* ft f+ ft ft 4~~ ft 

eg occccy uy 


cagggagggt 


aagtggttag 


ggccgt eggg 


cagggacacg 


o 4 U U 


/t ft ft ft 4— — \ * <^-#- — > 


f /^r<a4-4-4-4*a4- 


caattgetge 


gt aggcact c 


cgtgcaggga 


tctgagaacg 


o4oU 


t cgaggtcga 


egggat ccga 


gaactt ct ct 


aggaaagegt 


ctatccaatc 


geagtegcaa 


8520 


ggtaag ctga 


ggacggtggg 


ccgctggggg 


gcgtccgcgg 


gcagttggga 


ggtgatgctg 


858 0 


ctga.tga.tgt 


aattaaagta 


ggcggt cttc 


aggeggegga 


tggtggcgag 


gaggaccacg 


8 64 0 


tc u l. ugggcc 


eggect gtt g 


aat gegcagg 


eg ctcggcca 


tgccccagg c 


ctcg ctctga 


o "~7 r\ r\ 


<""< 3 ft r"« /^T =i ' ft ft a 


ftffi~ f 1 4~ 4~ 4— ft +~ ^ 

y y Lccccy La 


ST 4— ^ s~*C s**i 4~» 4" y**^ 

guaguctugc 


a tz. c a gn. c u c tz 


ccaccggaac 


ct ct gctt ct 


o / bU 


i"* < ^ ft /— < 4— <^f4— 4— ft 

cccCLyucty 


ecaugegagu 


cgagccgaac 


ccc cgcaggg 


getgeagcaa 


egctaggteg 


882 0 


gccacgaccc 


tctcggccag 


cacggcctgt 


tggatctgcg 


tgagggtggt 


ctggaagtcg 


8 8 8 0 


tccaggtcca 


egaageggtg 


ataggccccc 


gtgttgatgg 


tgtaggtgca 


gttggccatg 


8940 


s~1 ft ft S /"» f • ^ ^"c4— 

aCyyaCCayL 


4— — i r—* /— < —» , — < 4-* 4— 

uyacgacLug 


catgeegggt 


t gggtgat ct 


ccgtgtactt 


gaggegegag 


n n n a 


taggcgcggg 


act cgaacac 


gtagtcgttg 


ca tgtgcgta 


ccagatactg 


gtagccaacc 


9060 


aggaagtggg 


gaggeggtte 


teggtacagg 


ggccagccga 


ctgtggcggg 


ggcgccgggg 


912 0 


/"*■ ^ ^ ft ff\ m ft ft 4— 

yaCdygi-Cy u 


ccagcatgag 


gcgatggtag 


tggtagatgt 


agegggagag 


ccaggtgatg 


918 0 


ccggccgagg 


tggtcgegge 


cctggtgaat 


t cgcggacgc 


ggtt ccagat 


gttgegcagg 


924 0 


ft ft ft ft ft -n /-r /-i 

yyycgaaay c 


gctccdLggu 


gggcacgctc 


tgccccgtga 


ggegggegea 


atcttgtacg 


9300 


ft 4— /—•■}— a 4- ft ft 

L- lu Lcty q uy y 


addaadyaCa 


ft ft ft ft ft ft 4" ft 4** 

gggcggLcai- 


cgactccctt 


c eg t age teg 


gggggtaaag 




LCyCaayyy l 


geggeggegg 


ggaaccccgg 


ttcgagaccg 


geeggat ccg 


ccgctcccga 


942 0 


L.y L.L Ly y L 


/-» /— • ft ft ~-> 4— f* s 
LULy d U C ci 


cgacgtccgc 


g Tzcgagaccc 


agccgcgacg 


ctccgcccca 


y 4 o u 


;ss 4— o /~< ft ft ft ft ft 

a Lduy y ay y y 


ft /t+~ /~«4— 4~ 4~ 4— ft 

y ay lull. i_i_y 


/-T4-rr4-4-4-4-4-4-/- 


gtagatgeat 


ccggtgctgc 


ggcagatgcg 


y o4 u 


a i™< f* T~ m <— ♦ <T 

cill. uLay auy 


ft ft ft ^ y— « \ 


ccgccgcggc 


ggcagtaaac 


ctgageggag 


gcggtgacag 


y bu u 


nrfpi rtrf^i actP\ n 


pr izi < — > 4- ff ft r*« 4- 4- 
y cty u uyyuL t_ 


4~ a ft ^ f f* 4~ ft ft =3 

L-ctyctoi—uyyct 


2» a <-r /^r -a ft -rs ft 

dydgggagag 


ft ft ft ft 4— ft ft ft ft ft 

yggcuggccc 


ggcugggagc 


yoou 


y l uy l. (— o c_ i— a. 


ydgagdCdCC 


cu.agggT-.izca 


get cgtgagg 


gacgccaggc 


aggcttttgt 


y /2 u 


ft ft ft fr -> — i /-^r f— < -3 /— f 

y Luyday Ldy 


o ;a ft ft ft 4" 4" 4- 
aaCCUy LCLd 


gggaccgcag 


eggtcaggag 


geggaggaga 


tgcgcgattg 


Q T o n 

y / o u 


y— * —j /-v <-y 4- 4* 4~ /T 

cagy LLLCyg 


gegggtagag 


agctgagggc 


gggcttcga t 




tcctgagggc 


9840 


ggaggatttc 


gagcccgacg 


agcgttctgg 


ggtgagcccg 


gcccgegctc 


acgtctcggc 


9900 


ggccaacctg 


gtgagcgcgt 


acgagcagac 


ggtgaacgag 


gagegcaact 


tccaaaagag 


9960 


ctttaacaat 


cacgtgagga 


ccctgatcgc 


gagggaggag 


gtgaccatcg 


ggctgatgea 


10020 


tctgtgggac 


ttcgtggagg 


cctacgtgca 


gaacccggcc 


agcaaacctc 


tgacggccca 


10080 
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gctgttcctg 


atcgtgcagc 


acagccgcga 


caacgagacg 


ttccgcgacg 


ccatgttgaa 


10140 


catcgcggag 


cccgagggtc 


gctggctctt 


ggatctgatt 


aacatcctgc 


agagcatcgt 


10200 


ggtgcaggag 


aggggcctca 


gcttagcgga 


caaggtggcg 


gccattaact 


attcgatgca 


10260 


gagcctgggg 


aagttctacg 


ctcgcaagat 


ctacaagagc 


ccttacgtgc 


ccatagacaa 


10320 


ggaggtgaag 


atagacagct 


tttacatgcg 


catggcgctg 


aaggtgctga 


cgctgagcga 


10380 


cgacctcggc 


gtgtaccgta 


acgacaagat 


ccacaaggcg 


gtgagcgcca 


gccgccggcg 


10440 


ggagctgagc 


gacagggagc 


tgatgcacag 


cctgcagagg 


gcgctggcgg 


gcgccgggga 


10500 


cgaggagcgc 


gaggcttact 


tcgacatggg 


agccgatctg 


cagtggcgtc 


ccagcgcgcg 


10560 


cgccttggag 


gcggcgggct 


accccgacga 


ggaggatcgg 


gacgatttgg 


aggaggcagg 


10620 


cgagtacgag 


gacgaagcct 


gaccgggcag 


gtgttgtttt 


agatgcagcg 


gccggcggac 


10680 


ggggccaccg 


cggatcccgc 


acttttggca 


tccatgcaga 


gtcaaccttc 


gggcgtgacc 


10740 


gcctccgatg 


actgggcggc 


ggccatggac 


cgcattatgg 


cgctgactac 


ccgcaacccc 


10800 


gaggctttta 


gacagcaacc 


ccaggccaac 


cgtttttcgg 


ccatcttgga 


agcggtggtg 


10860 


ccctcccgca 


ccaaccccac 


acacgagaaa 


gtcctgacta 


tcgtgaacgc 


cctggtagac 


10920 


agcaaggcca 


tccgccgcga 


cgaggcgggc 


ttgatttaca 


acgctctgct 


ggaacgggtg 


10980 


gcgcgctaca 


acagcactaa 


cgttcagacc 


aatctggatc 


gcctcaccac 


cgacgtgaag 


11040 


gaggcgctgg 


ctcagaagga 


gcggtttctg 


agggacagca 


atctgggctc 


tctggtggca 


11100 


ctcaacgcct 


tcctgagcac 


gcagccggcc 


aacgtgcccc 


gcgggcagga 


ggactacgtg 


11160 


agcttcatca 


gcgctctgag 


gctgctggtg 


tccgaggtgc 


cccagagcga 


ggtgtatcag 


11220 


tctgggccgg 


attacttctt 


ccagacgtcc 


cgacagggct 


tgcaaacggt 


gaacctgact 


11280 


caggccttta 


aaaacttgca 


aggcatgtgg 


ggcgttaagg 


ccccggtggg 


cgatcgagcc 


11340 


accatctcca 


gtctgctgac 


ccccaacact 


cgcctgctgc 


tgctcttgat 


cgcgccgttc 


11400 


accaacagta 


gcactatcag 


ccgtgactcg 


tacctgggtc 


atctcatcac 


tttgtaccgc 


11460 


gaggccatcg 


gtcaggctca 


gatcgacgag 


cacacatatc 


aggagatcac 


taacgtgagc 


11520 


cgggccctgg 


gtcaggaaga 


taccggcagc 


ctggaagcca 


cgttgaactt 


tttgctaacc 


11580 


aaccggaggc 


aaaaaatacc 


ctcccagttt 


acgttaagcg 


ccgaggagga 


gaggattctg 


11640 


cgatacgtgc 


agcagtccgt 


gagtctgtac 


ttgatgcggg 


agggcgccac 


cgcttccacg 


11700 


gctttagaca 


tgacggctcg 


gaacatggaa 


ccgtcctttt 


actccgccca 


ccggccgttc 


11760 
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attaaccgtc tgatggacta cttccatcgc gcggccgcca tgaacgggga gtacttcacc 11820 

aatgccatcc tgaatccgca ttggatgccc ccgtccggct tctacaccgg cgagtttgac 11880 

ctgcccgaag ccgacgacgg ctttctttgg gacgacgtgt ccgacagcat tttcacgccg 11940 

ggcaatcgcc gattccagaa gaaggagggc ggagacgagc tccccctctc cagcgtggag 12000 

gcggcctcta ggggagagag tccctttccc agtctgtctt ccgccagcag tggtcgggta 12060 

acgcgcccgc ggttgccggg ggagagcgac tacctgaacg accccttgct gcggccggct 12120 

aggaagaaaa atttccccaa caacggggtg gaaagcttgg tggataaaat gaatcgttgg 12180 

aagacctacg cccaggagca gcgggagtgg gaggacagtc agccgcgacc gctggttccg 12240 

ccgcactggc gtcgtcagag agaagacccg gacgactccg cagacgatag tagcgtgttg 12300 

gacctgggag ggagcggagc caaccccttt gctcacttgc aacccaaggg gcgttccagt 12360 

cgcctctact aataaaaaag acgcggaaac ttaccagagc catggccaca gcgtgtgtcc 12420 

tttcttcctc tctttcttcc tcggcgcggc aga atg aga aga gcg gtg aga gtc 12474 

Met Arg Arg Ala Val Arg Val 
1 5 

acg ccg gcg gcg tat gag ggt ccg ccc cct tct tac gaa age gtg atg 12522 
Thr Pro Ala Ala Tyr Glu Gly Pro Pro Pro Ser Tyr Glu Ser Val Met 
10 " 15 20 

gga tea gcg aac gtg ccg gec acg ctg gag gcg cct tac gtt cct ccc 12570 
Gly Ser Ala Asn Val Pro Ala Thr Leu Glu Ala Pro Tyr Val Pro Pro 
25 30 35 



aga tac ctg gga cct acg gag ggc aga aac age ate cgt tac tec gag 
Arg Tyr Leu Gly Pro Thr Glu Gly Arg Asn Ser lie Arg Tyr Ser Glu 
40 ~ 45 50 55 



12618 



ctg gca ccc ctg tac gat ace ace aag gtg tac ctg gtg gac aac aag 
Leu Ala Pro Leu Tyr Asp Thr Thr Lys Val Tyr Leu Val Asp Asn Lys 
60 65 70 



12666 



teg gcg gac ate gee tec ctg aat tat caa aac gat cac age aat ttt 
Ser Ala Asp lie Ala Ser Leu Asn Tyr Gin Asn Asp His Ser Asn Phe 
75 80 85 



12714 



ctg act ace gtg gtg cag aac aat gac ttc acc ccg acg gag gcg ggc 12762 

Leu Thr Thr Val Val Gin Asn Asn Asp Phe Thr Pro Thr Glu Ala Gly 

90 95 100 

acg cag acc att aac ttt gac gag cgt tec cgc tgg ggc ggt cag ctg 12810 

Thr Gin Thr lie Asn Phe Asp Glu Arg Ser Arg Trp Gly Gly Gin Leu 

105 110 115 

aaa acc ate ctg cac acc aac atg ccc aac ate aac gag ttc atg tec 12858 

Lys Thr lie Leu His Thr Asn Met Pro Asn lie Asn Glu Phe Met Ser 

120 125 130 135 
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acc aac aag ttc agg gcc agg ctg atg gtt aaa aag get gaa aac cag 
Thr Asn Lys Phe Arg Ala Arg Leu Met Val Lys Lys Ala Glu Asn Gin 
140 145 150 



12906 



cct ccc gag tac gaa tgg ttt gag ttc acc att ccc gag ggc aac tat 
Pro Pro Glu Tyr Glu Trp Phe Glu Phe Thr lie Pro Glu Gly Asn Tyr 
155 160 165 



12954 



tec gag acc atg act ate gat ctg atg aac aat gcg ate gtg gac aat 
Ser Glu Thr Met Thr lie Asp Leu Met Asn Asn Ala lie Val Asp Asn 
170 175 180 



13002 



tac ctg caa gtg ggg agg cag aac ggg gta ttg gaa age gat ate ggc 
Tyr Leu Gin Val Gly Arg Gin Asn Gly Val Leu Glu Ser Asp lie Gly 
185 190 ~ 195 



13050 



gta aaa ttt gat acc aga aac ttc cga ctg ggg tgg gat ccc gtg acc 
Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Trp Asp Pro Val Thr 
200 " ~ 205 210 ' 215 



13098 



aag ctg gtg atg cca ggc gtg tac acc aac gag get ttt cac ccc gac 
Lys Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp 
220 225 230 



13146 



ate gtg ctg ctg ccg ggg tgc ggt gtg gac ttc act cag age cgt ttg 
lie Val Leu Leu Pro Gly Cys Gly Val Asp Phe Thr Gin Ser Arg Leu 
235 240 245 



13194 



agt aac ctg tta ggg ate aga aag cgc cgc ccc ttc caa gag ggc ttt 
Ser Asn Leu Leu Gly lie Arg Lys Arg Arg Pro Phe Gin Glu Gly Phe 
250 255 260 



13242 



cag ate atg tat gag gac ctg gaa gga ggt aac att cca ggt ttg eta 
Gin lie Met Tyr Glu Asp Leu Glu Gly Gly Asn He Pro Gly Leu Leu 
265 270 275 



13290 



gac gtg ccg gcg tat gaa gag agt gtt aaa cag gcg gag gcg cag gga 
Asp Val Pro Ala Tyr Glu Glu Ser Val Lys Gin Ala Glu Ala Gin Gly 
280 285 290 295 



13338 



cga gag att cga ggc gac acc ttt gcc acg gaa cct cac gaa ctg gta 
Arg Glu He Arg Gly Asp Thr Phe Ala Thr Glu Pro His Glu Leu Val 
300 305 310 



13386 



ata aaa cct ctg gaa caa gac agt aaa aaa egg agt tac aac att ata 
He Lys Pro Leu Glu Gin Asp Ser Lys Lys Arg Ser Tyr Asn He He 
315 ~ 32 0 " ~ ~ 325 



13434 



tec ggc act atg aat acc ttg tac egg age tgg ttt ctg get tac aac 
Ser Gly Thr Met Asn Thr Leu Tyr Arg Ser Trp Phe Leu Ala Tyr Asn 
330 335 ~ 340 



13482 



tac ggg gat ccc gaa aag gga gtg aga tea tgg acc ata etc acc acc 
Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp Thr He Leu Thr Thr 
345 350 355 



13530 
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acg gac gtg acc tgc ggc teg cag caa gtg tac tgg tec ctg ccg gat 13578 
Thr Asp Val Thr Cys Gly Ser Gin Gin Val Tyr Trp Ser Leu Pro Asp 
360 365 370 375 

atg atg caa gac ccg gtc acc ttc cgc ccc tec acc caa gtc age aac 13626 
Met Met Gin Asp Pro Val Thr Phe Arg Pro Ser Thr Gin Val Ser Asn 
380 385 390 

ttc ccg gtg gtg ggc acc gag ctg ctg ccc gtc cat gec aag age ttc 13674 
Phe Pro Val Val Gly Thr Glu Leu Leu Pro Val His Ala Lys Ser Phe 
395 400 405 

tac aac gaa cag gec gtc tac teg caa etc att cgc cag tec acc gcg 13722 
Tyr Asn Glu Gin Ala Val Tyr Ser Gin Leu lie Arg Gin Ser Thr Ala 
410 415 420 

ctt acc cac gtg ttc aat cgc ttt ccc gag aac cag att ctg gtg cgc 13770 
Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gin lie Leu Val Arg 
425 430 435 

cct ccc get cct acc att acc acc gtc agt gaa aac gtt ccc gee etc 13818 
Pro Pro Ala Pro Thr lie Thr Thr Val Ser Glu Asn Val Pro Ala Leu 
440 445 450 455 

aca gat cac gga acc ctg ccg ctg cgc age agt ate agt gga gtt cag 13866 
Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser lie Ser Gly Val Gin 
460 465 470 

cgc gtg acc ate acc gac gee aga cgt cga acc tgt ccc tac gtt tac 13914 
Arg Val Thr lie Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val Tyr 
475 480 485 

aaa get ctt ggc gta gtg get cct aaa gtg etc tct agt cgc acc ttc 13962 
Lys Ala Leu Gly Val Val Ala Pro Lys Val Leu Ser Ser Arg Thr Phe 
490 495 500 

taa acatgtccat cctcatctct cccgataaca acaccggctg gggactgggc 14015 

teeggcaaga tgtaeggegg agecaaaagg cgctccagtc agcacccagt tegagttegg 14075 

ggccacttcc gtgctccctg gggagcttac aagegaggae tctcgggccg aacggcggta 14135 

gacgatacca tagatgeegt gattgecgae gcccgccggt acaaccccgg accggtcgct 14195 

agcgccgcct ccaccgtgga ttccgtgatc gacagcgtgg tagctggege tegggectat 14255 

gctcgccgca agaggegget geateggaga cgtcgcccca ccgccgccat gctggcagcc 14315 

agggccgtgc tgaggeggge ccggagggta ggcagaaggg ctatgcgccg cgctgccgcc 14375 

aacgccgccg cegggaggge ccgccgacag gctgcccgcc aggctgetge cgccatcgct 14435 

ageatggeca gacccaggag agggaaegtg tactgggtgc gcgattctgt gaegggagtc 14495 

egagtgeegg tgcgcagccg acctccccga agttagaaga tccaagctgc gaagacggcg 14555 

gtactgagtc tccctgttgt tatcagccca acatgagcaa gcgcaagttt aaagaagaac 14615 
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tgctgcagac 
cccgcgatat 
atggcggagt 
ggcgggtaca 
cggccgttag 
aacaggcggc 
tggccatcgc 
aggtgctccc 
atctgcagcc 
agaaaatgaa 
tggcccctgg 
ccgccgtgga 
tggcgttgca 
ctcgttacgg 
ccacccccgg 
gccgccgccg 
gaaagacagt 
gttttgcaga 
tctcgtcgta 
aaaggcggaa 
ggcgtcgctt 
atgtcctggt 
tggctccgcg 
agctcaacgg 
ccacgattaa 
ataaactgaa 
acggcgcggt 
gctcgcgggt 



gctggtgcct 
caagcgtgtt 
ggaatttatt 
acgcgttttg 
gggtttcaaa 
tcaacagatc 
tttagacagc 
cgtgagcgcc 
caccatccag 
agtggaccca 
tctcggggtg 
agccatggaa 
aaccgacccc 
ccccgccaac 
ctaccgggga 
ctcccgtcgt 
taccattccc 
tggctctgac 
ggagaggcat 
ttttacccgc 
cagtggcctt 
cctgactatt 
gcacggctcg 
gggcgctttc 
atcctacggc 
ggacaccaac 
agatctcgcc 
gccgccgcag 



gagatctatg 
aaaaagcggg 
aggagtttcg 
aggccgggca 
aggcagtacg 
ggagaatttg 
ggcaacccca 
agcacggaca 
ctcatggtcc 
agcatagagc 
cagacggtgg 
acgcaaacgg 
tggtacgaat 
gccatcatgc 
gtgacgtatc 
gctctggccc 
aacccgcgct 
ttgccgcgtg 
ggcgggtagt 
tctgataccc 
gcaagcagct 
ttatgcagaa 
cggccgctca 
aattggggga 
agcaaagcct 
ttccaagaaa 
aaccaagcgg 
agaggggatg 



gccctccgga 
aaaagaaaga 
ccccgcgacg 
ccgcggtagt 
acgaggtgta 
cctacggaaa 
cgcccagcct 
gcaagagggg 
ctaaacggca 
cggacgtcaa 
atatccagat 
aaacccctgc 
acgccgcccc 
cagaatatgc 
gcccgtcagg 
ccgtgtcggt 
accaccctag 
cgccttcccg 
ggtcgccggc 
ataatcgccg 
cgtaataaat 
agagcatgga 
tgggcacctg 
gcctttggag 
ggaacagtag 
aagtggtcaa 
tgcagaaaga 
aggtggaggt 



cgtgaagcct 
ggaactcgcg 
cagggttcaa 
ttttaccccg 
cggcgacgag 
gcgttcgcgt 
caaacctgtg 
aataaaaaga 
gaggctggaa 
agtcaggccg 
ccccgtcacg 
cgcgatcggt 
tcggcgtcag 
gctgcatccg 
aacccgccgc 
gcgccgcgta 
catcctttaa 
ttccgcacta 
gggctttgcg 
ccgccatcgg 
aaacgaaggc 
agacatcaat 
gaacgacatc 
cggcattaaa 
tgctggtcag 
tggggtggtg 
gattgacagg 
cgaggaagta 



gacattaagc 
gtggtagacg 
tggaaagggc 
ggagagcggt 
gacatattgg 
cgcgaagacc 
acgctgcagc 
gaaatggaag 
gaggtcctgg 
atcaaagaag 
tcagcttcga 
accagggaag 
aggcgacccg 
tctatcctgc 
cgaacccgtc 
acacgccggg 
tgactctgcc 
tcgaggaaga 
caggcgcatg 
tgccataccc 
ttttgcactt 
tttacgtcgc 
ggcaccagtc 
aactttggct 
atgctccgag 
accggcatcc 
cgtttggaaa 
gaagtagagg 



14675 
14735 
14795 
14855 
14915 
14975 
15035 
15095 
15155 
15215 
15275 
15335 
15395 
15455 
15515 
15575 
15635 
15695 
15755 
15815 
15875 
15935 
15995 
16055 
16115 
16175 
16235 
16295 
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aaaagctgcc 


cccgctggag 


aaagttcccg 


gtgcgcctcc 


gagacegcag 


aagcgaccca 


16355 


ggccagaact 


agaagaaact 


ctggtgacgg 


agagcaagga 


gcctccctcg 


tacgagcaag 


16415 


ccttgaaaga 


gggcgcctct 


ccaccctacc 


caatgacaaa 


accgatcgcg 


cct a tgg etc 


16475 


ggccggtgta 


cgggaaggac 


tacaagcctg 


teaegctaga 


get ccccccg 


ccgccaccgc 


16535 


cgccccccac 


gcgcccgacc 


gttccccccc 


ccctcfccaac 

ZJ 33 z) 


t ccgt egg eg 


crcra cccat rrt~ 

y y c*. 1 — - y < — y i — - 


16595 


ccgcacccgt 


cgccgtgcct 


ctgccagccg 


cccgcccagt 


ggccgtggcc 

ZJ ZJ ZJ ZJ ZJ 


actgecagaa 


16655 


accccagagg 


ccagagagga 


gccaactggc 


aaagcacget 


gaacagcatc 


gtgggcctgg 


16715 


gagtgaaaag 


cctgaaacgc 


cgccgttgct 


attattaaaa 


gtgtagctaa 


aaaatttccc 


16775 


gttgtatacg 


cctcctatgt 


taccgccaga 


gaegegtgae 


tgtcgccgcg 


agcgccgctt 


16835 


tcaag atg gcc acc cca 
Met Ala Thr Pro 
505 


teg atg atg 
Ser Met Met 
510 


ccg cag tgg tct tac atg cac ate 
Pro Gin Trp Ser Tyr Met His He 
515 


16885 



gcc ggg cag gac gcc teg gag tac ctg age ccc ggt etc gtg cag ttc 16933 
Ala Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe 
520 525 530 

gcc cgc gcc acc gac acc tac ttc age ttg gga aac aag ttt aga aac 16981 
Ala Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn 
535 540 545 "* 550 

ccc acc gtg gcc ccc acc cac gat gta acc acg gac cgc teg caa agg 17029 
Pro Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg 
555 560 565 

ctg acc ctg cgt ttt gtg ccc gta gac egg gag gac acc gcg tac tct 17 077 
Leu Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Thr Ala Tyr Ser 
570 575 ~ 580 

tac aaa gtg cgc tac acg ctg gcc gta ggg gac aac cga gtg ctg gac 17125 
Tyr Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp 
585 590 595 

atg gcc age acc tac ttt gac ate egg gga gtg ctg gat cgc ggt ccc 17173 
Met Ala Ser Thr Tyr Phe Asp He Arg Gly Val Leu Asp Arg Gly Pro 
600 605 610 

agt ttt aag ccc tac teg ggt acc gcg tac aat tec ctg get ccc aag 17221 
Ser Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys 
615 620 625 630 

ggc get ccc aac cct gca gaa tgg acg aat tea gac age aaa gtt aaa 17269 
Gly Ala Pro Asn Pro Ala Glu Trp Thr Asn Ser Asp Ser Lys Val Lys 
635 640 645 

gtg agg gca cag gcg cct ttt gtt age teg tat ggt get aca gcg att 17317 
Val Arg Ala Gin Ala Pro Phe Val Ser Ser Tyr Gly Ala Thr Ala He 
650 655 660 
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aca aaa gag ggt att cag gtg gga gta acc tta aca gac tec gga tea 17365 
Thr Lys Glu Gly He Gin Val Gly Val Thr Leu Thr Asp Ser Gly Ser 

665 670 • 675 

aca cca cag tat gca gat aaa acg tat cag cct gag ccg caa att gga 17413 
Thr Pro Gin Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin He Gly 
680 685 690 

gaa eta cag tgg aac age gat gtt gga acc gat gac aaa ata gca gga 17461 
Glu Leu Gin Trp Asn Ser Asp Val Gly Thr Asp Asp Lys He Ala Gly 
695 700 705 " 710 

aga gtg eta aag aaa aca acg ccc atg ttc cct tgt tac ggc tea tat 17509 
Arg Val Leu Lys Lys Thr Thr Pro Met Phe Pro Cys Tyr Gly Ser Tyr 
715 720 ~ 725 

gee agg ccc act aat gaa aaa gga gga cag gca aca ccg tec get agt 17557 
Ala Arg Pro Thr Asn Glu Lys Gly Gly Gin Ala Thr Pro Ser Ala Ser 
730 735 740 

caa gac gtg caa aat ccc gaa tta caa ttt ttt gec tct act aat gtc 17605 
Gin Asp Val Gin Asn Pro Glu Leu Gin Phe Phe Ala Ser Thr Asn Val 
745 750 755 

gee aat aca cca aaa gca gtt eta tat gcg gag gac gtg tea att gaa 17653 
Ala Asn Thr Pro Lys Ala Val Leu Tyr Ala Glu Asp Val Ser He Glu 
760 765 770 

gcg cca gac act cac ttg gtg ttc aaa cca aca gtc act gaa ggc att 17701 
Ala Pro Asp Thr His Leu Val Phe Lys Pro Thr Val Thr Glu Gly He 
775 780 785 



790 



aca agt tea gag get eta ctg acc caa caa get get ccc aac cgt cca 
Thr Ser Ser Glu Ala Leu Leu Thr Gin Gin Ala Ala Pro Asn Arg Pro 



795 800 805 



caa get gtg gat agt tac gat cct gat gta aga ate ata gaa aac cat 
Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg He He Glu Asn His 
875 880 885 



17749 



aac tac ata gee ttt aga gat aat ttt att ggt etc atg tac tac aat 17797 
Asn Tyr He Ala Phe Arg Asp Asn Phe He Gly Leu Met Tyr Tyr Asn 
810 815 82 0 

age aca ggt aac atg gga gta ctg gca ggc cag get tct cag eta aat 17845 
Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn 
825 830 835 

gca gtt gtt gac ctg caa gac aga aat act gag ctg tec tac caa etc 17893 
Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu 
840 845 850 

atg ttg gac gee etc gga gac cgc agt egg tac ttt tct atg tgg aac 17941 
Met Leu Asp Ala Leu Gly Asp Arg Ser Arg Tyr Phe Ser Met Trp Asn 
855 860 865 870 



17989 
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ggc gta gaa gat gaa ttg cct aat tat tgc ttt cct ttg gga ggc atg 
Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly Met 
890 895 900 



18037 



gca gta acc gac acc tac teg cct ata aag gtt aat gga gga ggc aat 18085 
Ala Val Thr Asp Thr Tyr Ser Pro lie Lys Val Asn Gly Gly Gly Asn 
905 910 915 

gga tgg gaa gec aat aac ggc gtt ttc acc gaa aga gga gtg gaa ata 18133 
Gly Trp Glu Ala Asn Asn Gly Val Phe Thr Glu Arg Gly Val Glu lie 
920 925 930 

ggt tea ggg aac atg ttt gec atg gag att aac ctg caa gec aac eta 18181 
Gly Ser Gly Asn Met Phe Ala Met Glu lie Asn Leu Gin Ala Asn Leu 
935 940 945 950 

tgg cgt age ttt ctg tac tec aat att ggg ctg tac ctg cca gac tct 18229 
Trp Arg Ser Phe Leu Tyr Ser Asn lie Gly Leu Tyr Leu Pro Asp Ser 
955 960 965 

etc aaa ate act cct gac aac ate aca etc cca gag aac aaa aac acc 18277 
Leu Lys lie Thr Pro Asp Asn lie Thr Leu Pro Glu Asn Lys Asn Thr 
970 975 980 

tat cag tat atg aac ggt cgc gtg acg cca ccc ggg ctg gtt gac acc 18325 
Tyr Gin Tyr Met Asn Gly Arg Val Thr Pro Pro Gly Leu Val Asp Thr 
985 990 995 

tac gtt aac gtg ggc gcg cgc tgg tec ccc gat gtc atg gac agt 18370 
Tyr Val Asn Val Gly Ala Arg Trp Ser Pro Asp Val Met Asp Ser 
1000 1005 1010 

att aac cct ttt aat cac cac cgc aac gee gga etc cgc tac cgt 18415 
lie Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg 
1015 1020 1025 

tec atg etc ctg gga aac gga cgc tac gtg ccc ttc cac ate cag 18460 
Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His lie Gin 
1030 1035 1040 

gtg ccc cag aaa ttc ttt gca att aaa aac ctg ctg ctg etc ccc 18505 
Val Pro Gin Lys Phe Phe Ala lie Lys Asn Leu Leu Leu Leu Pro 
1045 1050 1055 

ggt tec tac acc tac gag tgg aac ttc cgc aag gac gtg aac atg 18550 
Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met 
1060 1065 1070 



ate ttg cag age teg ctg ggc aat gac ctg cga gtg gac ggg gee 
lie Leu Gin Ser Ser Leu Gly Asn Asp Leu Arg Val Asp Gly Ala 
1075 1080 1085 



18595 



age ate cgc ttc gac age ate aac ctg tac gee aac ttt ttc ccc 18640 
Ser lie Arg Phe Asp Ser lie Asn Leu Tyr Ala Asn Phe Phe Pro 
1090 1095 1100 
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atg gcc cac aac acg gcc 

Met Ala His Asn Thr Ala 
1105 

gac acc aac gac caa tct 

Asp Thr Asn Asp Gin Ser 
1120 

atg ctg tac ccc ate ccc 

Met Leu Tyr Pro lie Pro 
1135 

att ccc tct cgc aac tgg 

lie Pro Ser Arg Asn Trp 
1150 

cgc etc aaa acc aag gag 

Arg Leu Lys Thr Lys Glu 
1165 

ccc tac ttc gtc tac tec 

Pro Tyr Phe Val Tyr Ser 
1180 

ttc tac etc aac cat act 

Phe Tyr Leu Asn His Thr 
1195 

tec tec gtc age tgg ccc 

Ser Ser Val Ser Trp Pro 
1210 

gag ttc gaa ate aag cgt 

Glu Phe Glu lie Lys Arg 
1225 

get cag age aac atg acc 

Ala Gin Ser Asn Met Thr 
1240 

age cac tac aac ate ggc 

Ser His Tyr Asn lie Gly 
1255 

tac aag gac cgc atg tac 

Tyr Lys Asp Arg Met Tyr 
127 0 

age cgc caa att gta gat 

Ser Arg Gin lie Val Asp 
1285 

gtg aaa ctg cca tac cag 

Val Lys Leu Pro Tyr Gin 
1300 



tec acc ctg gaa gcc 
Ser Thr Leu Glu Ala 
1110 

ttc aac gac tac ctg 
Phe Asn Asp Tyr Leu 
1125 

gcc aac gcc acc age 
Ala Asn Ala Thr Ser 
1140 

gca gcc ttc agg ggc 
Ala Ala Phe Arg Gly 
1155 

acc ccc teg ctg ggc 
Thr Pro Ser Leu Gly 
1170 

ggc tec ate cec tac 
Gly Ser lie Pro Tyr 
1185 

ttc aaa aag gtg tea 
Phe Lys Lys Val Ser 
1200 

ggc aac gac cgt ctg 
Gly Asn Asp Arg Leu 
1215 

teg gtg gac ggt gaa 
Ser Val Asp Gly Glu 
1230 

aag gac tgg ttc ctg 
Lys Asp Trp Phe Leu 
1245 

tac cag ggc ttc tac 
Tyr Gin Gly Phe Tyr 
1260 

tct ttc ttc aga aac 
Ser Phe Phe Arg Asn 
1275 

tea acg get tac act 
Ser Thr Ala Tyr Thr 
1290 

cat aac aac tea ggg 
His Asn Asn Ser Gly 
1305 



atg ctg cgc aac 18685 

Met Leu Arg Asn 

1115 

tgc gcg gcc aac 18730 

Cys Ala Ala Asn 

1130 

gtg ccc ate tec 18775 

Val Pro lie Ser 

1145 

tgg agt ttc acc 18820 

Trp Ser Phe Thr 

1160 

tec ggg ttc gac 18865 

Ser Gly Phe Asp 

1175 

ctg gac ggc acc 18910 

Leu Asp Gly Thr 

1190 

ate atg ttc gac 18955 

lie Met Phe Asp 

1205 

ctg acg ccc aac 19000 

Leu Thr Pro Asn 

1220 

ggg tac aac gtg 19045 

Gly Tyr Asn Val 

1235 

att cag atg etc 19090 

lie Gin Met Leu 

1250 

gtg ccc gaa aat 19135 

Val Pro Glu Asn 

1265 

ttc caa ccc atg 19180 

Phe Gin Pro Met 

1280 

aat tat cag gat 19225 

Asn Tyr Gin Asp 

1295 

ttc gtg ggc tac 19270 

Phe Val Gly Tyr 

1310 
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atg gga ccc acc atg cga gag ggg cag gcc tac ccg gcc aac tat 19315 

Met Gly Pro Thr Met Arg Glu Gly Gin Ala Tyr Pro Ala Asn Tyr 

1315 1320 1325 

ccc tat ccc ctg att ggg gcc acc gcc gtg ccc age etc acg cag 19360 

Pro Tyr Pro Leu lie Gly Ala Thr Ala Val Pro Ser Leu Thr Gin 

1330 1335 1340 

aaa aag ttc etc tgc gac egg gtg atg tgg agg ate ccc ttc tct 19405 

Lys Lys Phe Leu Cys Asp Arg Val Met Trp Arg lie Pro Phe Ser 

1345 1350 1355 

age aac ttc atg tct atg ggc tec etc acc gac ctg ggg cag aac 19450 

Ser Asn Phe Met Ser Met Gly Ser Leu Thr Asp Leu Gly Gin Asn 

1360 1365 1370 

atg ctg tac gcc aac tec get cac gcc ttg gat atg acc ttt gag 19495 

Met Leu Tyr Ala Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu 

1375 ~ 1380 1385 

gtg gat ccc atg gat gag ccc acg ctt etc tat gtt ctg ttt gaa 19540 

Val Asp Pro Met Asp Glu Pro Thr Leu Leu Tyr Val Leu Phe Glu 

1390 1395 1400 

gtc ttc gac gtg gtg cgc ate cac cag ccg cac cgc ggc gtc ate 19585 

Val Phe Asp Val Val Arg lie His Gin Pro His Arg Gly Val lie 

1405 1410 1415 



gag gcc gtc tac ctg cgc aca cct ttc tct gcc ggt aac gcc acc 
Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr 
1420 " 1425 1430 



19630 



acc taa agaagecgat gggctccagc gaacaggagc tgeaggecat tgttcgcgac 19686 
Thr 

ctgggctgcg ggccctactt tttgggcacc ttcgacaagc gttttccegg cttcatgtcc 19746 

ccccacaagc cggcctgtgc categttaac acggccggac gggagacegg gggggtccac 19806 

tggctcgcct tcgcctggaa cccgcgtaac cgcacctgct acctgttcga cccttttggt 19866 

ttctccgacg aaaggctgaa gcagatctac cagttcgagt acgaggggct cctcaagcgc 19926 

agegctctgg cctccacgcc cgaccactgc gtcaccctgg aaaagtccac ccaaacggtc 19986 

caggggcccc tctcggccgc ctgcgggctc ttctgttgca tgtttttgca cgccttcgtg 20046 

cactggcctc acacccccat ggatcacaac cccaccatgg atctgctcac cggagtgccc 20106 

aacagcatgc ttcacagccc ccaggtcgcc cccaccctgc gccgtaacca ggaacacctg 20166 

tategcttte tggggaaaca ctctgcctat tttcgccgcc accggcagcg catcgaaegg 20226 

gccacggcct tcgaaagcat gagecaaaga gtgtaatcaa taaaaaacat ttttatttga 20286 

catgataege gcttctggcg ttttattaaa aatcgaaggg ttcgagggag gggtcctcgt 20346 
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gcccgctggg gagggacacg ttgcgatact ggaaacgggc gctccaacga aactcgggga 20406 

tcaccagccg cggcaggggc acgtcttcta ggttctgctt ccaaaactgc cgcaccagct 2 04 66 

gcagggctcc catgacgtcg ggcgccgata tcttgaagtc gcagttaggg ccggagctcc 20526 

cgcggctgtt gcggaacacg gggttggcac actggaacac cagcacgccg gggttgtgga 20586 

tactggccag ggccgtcggg tcggtcacct ccgacgcatc cagatcctcg gcgttgctca 20646 

gggcaaacgg ggtcagcttg cacatctgcc gcccaatctg ■ gggtactagg tcgcgcttgt 20706 

tgaggcagtc gcagcgcaga gggatcagga tgcgtcgctg cccgcgttgc atgatagggt 20766 

aactcgccgc caggaactcc tccatttgac ggaaggccat ctgggctttg ccgccctcgg 20826 

tgtagaatag cccgcaggac ttgctagaga atacgttatg accgcagttg acgtcctccg 20886 

cgcagcagcg ggcgtcttcg ttcttcagct gaaccacgtt gcggccccaa cggttctgga 20946 

ccaccttggc tctagtgggg tgctccttca gcgcccgctg tccgttctcg ctggttacat 21006 

ccatttccaa cacgtgctcc ttgcagacca tctccactcc gtggaagcaa aacaggacgc 21066 

cctcctgctg ggtactgcga tgctcccata cggcgcatcc ggtgggctcc cagctcttgt 21126 

gttttacccc cgcgtaggct tccatgtaag ccataaggaa tctgcccatc agctcggtga 21186 

aggtcttctg gttggtgaag gttagcggca ggccgcggtg ctcctcgttc aaccaagttt 21246 

gacagatctt gcggtacacc gctccctggt cgggcagaaa cttaaaagcc gctctgctgt 21306 

cgttgtctac gtggaacttc tccattaaca tcatcatggt ttccataccc ttctcccacg 21366 

ctgtcaccag tggtttgctg tcggggttct tcaccaacac ggcggtagag gggccctcgc 21426 

cggccccgac gtccttcatg gtcattcttt gaaactccac ggagccgtcc gcgcgacgta 21486 

ctctgcgcac cggagggtag ctgaagccca cctccaccac ggtgccttcg ccctcgctgt 21546 

cggagacaat ctccggggat ggcggcggcg cgggtgtcgc cttgcgagcc ttcttcttgg 21606 

gagggagctg aggcgcctcc tgctcgcgct cggggctcat ctcccgcaag tagggggtaa 21666 

tggagctgcc tgcttggttc tgacggttgg ccattgtatc ctaggcagaa agacatggag 21726 

cttatgcgcg aggaaacttt aaccgccccg tcccccgtca gcgacgaaga tgtcatcgtc 21786 

gaacaggacc cgggctacgt tacgccgccc gaggatctgg aggggcctga ccggcgcgac 21846 

gctagtgagc ggcaggaaaa tgagaaagag gaggcctgct acctcctgga aggcgacgtt 21906 

ttgctaaagc atttcgccag gcagagcacc atagttaagg aggccttgca agaccgctcc 21966 

gaggtgccct tggacgtcgc cgcgctctcc caggcctacg aggcgaacct tttctcgcct 22026 
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cgagtgcctc 
taccccgtgt 
cgcatccccc 
aacggagcca 
ctgggtcgag 
aacgtgctgg 
gaagtctccc 
atggatcagc 
aactcagagg 
agggaccccc 
gtggagctgg 
gaatccctgc 
gtggagctca 
agcgtgctgc 
tacctcttcc 
gagagaaacc 
ggtttcagcg 
atgaaaacct 
cgctcttttg 
gactttgtcc 
ttccaactgg 
ggtttactgg 
aacaccgagc 
cagcaagagg 
cgcaaatttg 
cgaccaccga 
ttgcaatcca 
ctggaccccc 
ccgagacatg 



cgaagagaca 
tcgccgtacc 
tatcgtgccg 
acatacctga 
atgagaagcg 
tggagctgga 
acttcgccta 
tgctcatcaa 
acggaaagcc 
aacagttgca 
aatgcctgca 
actacacctt 
gcaacctggt 
actgcacctt 
tcaccctcac 
tcaaagagct 
agcgcacggt 
tgcaaaacgg 
tcctggaacg 
ccctctccta 
ccaactttct 
agtgccactg 
tactcagcga 
gtgcttccgg 
tagccgagga 
aagcccccct 
tcaaccaagc 
agaccggcga 
ccgcccaagg 



gcccaacggc 
agaggcgctg 
ggccaaccgc 
tatcacgtcg 
ggcggcgaac 
gggggacaac 
ccccgccctc 
gagagctgag 
cgtcgtcagc 
agagaggcgc 
acggtttttc 
ccgccagggc 
ctcctacatg 
gcaaggcgag 
ctggcagacc 
agacaaactc 
cgccagcgct 
cctgccggat 
ctccgggatc 
ccgcgagtgc 
ggcctaccac 
ccgctgcaac 
aacccaggtc 
cttgaagctc 
ctaccacgcc 
cacggcctgc 
gcgccgcgat 
ggaactcaac 
gaaccgccaa 



acctgcgagc 
gccacctatc 
accgcggccg 
ctggaggaag 
gctctgcaga 
gcgcgtctgg 
aacttgccac 
cccctggatc 
gacgaggagc 
aagatgatga 
agcgacgtgg 
tacgtccgcc 
ggcatcctcc 
gcgcggcggg 
gccatgggcg 
ctctgccgcc 
ctggcggaca 
ttcatcagtc 
ttgcccgcca 
cctcccccac 
tccgacctca 
ctgtgcaccc 
ataggtacct 
actccggcgc 
cacaaaattc 
gtcatcaccc 
ttccttttga 
ccgtccacac 
gcagctgatc 



ccaacccgcg 
acattttttt 
ataggaatct 
tgcccaagat 
aagaacagaa 
ccgtcctcaa 
ccaaagttat 
ccgaccaccc 
tcgagcggtg 
tggcggccgt 
agacgctacg 
aggcctgcaa 
acgagaaccg 
actacgtgcg 
tctggcagca 
agcggcgcgc 
tcatcttccc 
aaagcatttt 
tgagctgcgc 
tgtggagcca 
tggaagacgt 
cccacagatc 
tcgagatcca 
tgtggacctc 
agttttacga 
agagcaagat 
aaaagggtcg 
tctccgtcga 
gctcggcaga 



actcaacttc 
caaaaaccaa 
caggcttaaa 
tttcgagggt 
agagagtcag 
acgctgcata 
gaaatcggtc 
cgaggcggaa 
gctggaaacc 
gctggtcacc 
caaaatcggg 
gatctccaac 
gctggggcag 
agactgcatc 
gtgcttggaa 
cctgtggtcc 
ggagcgcctg 
gcaaaacttc 
gctaccttct 
ctgctacctc 
aagcggagag 
gctggcctgc 
ggggccccag 
ggcttactta 
agaccaatct 
cctggcccaa 
gggggtgtac 
agcagccccc 
gagcgaagaa 



22086 
22146 
22206 
22266 
22326 
22386 
22446 
22506 
22566 
22626 
22686 
22746 
22806 
22866 
22926 
22986 
23046 
23106 
23166 
23226 
23286 
23346 
23406 
23466 
23526 
23586 
23646 
23706 
23766 
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gcaagagctg 


ctccagcagc 


aggtggagga 


cgaggaagag 


atgtgggaca 


gccaggcaga 


23826 


ggaggtgtca 


gaggacgagg 


aggagatgga 


aagctgggac 


agcctagacg 


aggaggagga 


23886 


cgagctttca 


gaggaagagg 


cgaccgaaga 


aaaaccacct 


gcatccagcg 


cgccttctct 


23946 


gagccgacag 


ccgaagcccc 


ggcccccgac 


gcccccggcc 


ggctcactca 


aagccagccg 


24006 


taggtgggac 


gccaccgaat 


ctccagcggc 


agcggcaacg 


gcagcgggta 


aggccaaacg 


24066 


cgagcggcgg 


gggtattgct 


cctggcgggc 


ccacaaaagc 


agtattgtga 


actgcttgca 


24126 


acactgcggg 


ggaaacatct 


cctttgcccg 


acgctacctc 


ctcttccatc 


acggtgtggc 


24186 


cttccctcgc 


aacgttctct 


attattaccg 


tcatctctac 


agcccctacg 


aaacgctcgg 


24246 


agaaaaaagc 


taaggcctcc 


tccgccgcga 


ggaaaaactc 


cgccgccgct 


gccgccgcca 


24306 


aggatccacc 


ggccaccgaa 


gagctgagaa 


agcgcatctt 


tcccactctg 


tatgctatct 


24366 


ttcagcaaag 


ccgcgggcag 


caccctcagc 


gcgaactgaa 


aataaaaaac 


cgctccttcc 


24426 


gctcgctcac 


ccgcagctgt 


ctgtaccaca 


agagagaaga 


ccagctgcag 


cgcaccctgg 


24486 


acgacgccga 


agcactgttc 


agcaaatact 


gctcagcgtc 


tcttaaagac 


taaaagaccc 


24546 


gcgctttttc 


cccctcggcc 


gccaaaaccc 


acgtcatcgc 


cagcatgagc 


aaggagattc 


24606 


ccacccccta 


catgtggagc 


tatcagcccc 


agatgggcct 


ggccgcgggg 


gccgcccagg 


24666 


actactccag 


caagatgaac 


tggctcagcg 


ccggccccca 


catgatctca 


cgagttaacg 


24726 


gcatccgagc 


ccaccgaaac 


cagattctct 


tagaacaggc 


ggcaatcacc 


gccacacccc 


24786 


ggcgccaact 


caacccgcct 


agttggcccg 


ccgcccaggt 


gtatcaggaa 


aatccccgcc 


24846 


cgaccacagt 


cctcctgcca 


cgcgacgcgg 


aggccgaagt 


cctcatgact 


aactctgggg 


24906 


tacaattagc 


gggcgggtcc 


aggtacgcca 


ggtacagagg 


tcgggccgct 


ccttactctc 


24966 


ccgggagtat 


aaagagggtg 


atcattcgag 


gccgaggtat 


ccagctcaac 


gacgagacgg 


25026 


tgagctcctc 


aaccggtctc 


agacctgacg 


gagtcttcca 


gctcggagga 


gcgggccgct 


25086 


cttccttcac 


cactcgccag 


gcctacctga 


ccctgcagag 


ctcttcctcg 


cagccgcgct 


25146 


ccgggggaat 


cggcactctc 


cagttcgtgg 


aagagttcgt 


tccctccgtc 


tacttcaacc 


25206 


ccttctccgg 


ctcgcctgga 


cgctacccgg 


acgccttcat 


tcccaacttt 


gacgcagtga 


25266 


gtgaatccgt 


ggacggctac 


gactgatgac 


agatggtgcg 


gccgtgagag 


ctcggctgcg 


25326 


acatctgcat 


cactgccgtc 


agcctcgctg 


ctacgctcgg 


gaggcgatcg 


tcttcagcta 


25386 


ctttgagctg 


ccggacgagc 


accctcaggg 


tccggctcac 


gggttgaaac 


tcgagatcga 


25446 
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gaacgcgctc 


gagtctcgcc 


tcatcgacac 


cttcaccgcc 


cgacctctcc 


tggtagaaat 


25506 


ccaacggggg 


atcactacca 


tcaccctgtt 


ctgcatctgc 


cccacgcccg 


gattacatga 


25566 


agatctgtgt 


tgtcatcttt 


gcgctcagtt 


taataaaaac 


tgaacttttt 


gccgcacctt 


25626 


caacgccatc 


tgtgatttct 


acaacaaaaa 


gttcttctgg 


caaaggtaca 


caaactgtat 


25686 


tttattctaa 


ttctacctca 


tctatcgtgc 


tgaactgcgc 


ctgcactaac 


gaacttatcc 


25746 


agtggattgc 


aaacggtagt 


gtgtgcaagt 


acttttgggg 


gaacgatata 


gttagtagaa 


25806 


ataacagcct 


ttgcgagcac 


tgcaactcct 


ccacactaat 


cctttatccc 


ccatttgtta 


25866 


ctggatggta 


tatgtgcgtt 


ggctccggtt 


taaatcctag 


ttgctttcat 


aagtggtttc 


25926 


tacaaaaaga 


gacccttccc 


aacaattctg 


tttctttttt 


cgccctatcc 


tactgctgtt 


25986 


ctccctctgg 


ttactctttc 


aaacctctaa 


ttggtatttt 


agctttgata 


ct cataat ct 


26046 


ttattaactt 


tataataatt 


aacaacttac 


agtaaacatg 


cttgttctac 


tgctcgccac 


26106 


atctttcgct 


ctctctcacg 


ccagaacaag 


tattgttggc 


gcaggttaca 


atgcaactct 


26166 


tcaatctgct 


tacatgccag 


attccgacca 


gataccccat 


attacgtggt 


acttacaaac 


26226 


ctccaaacct 


aattcttcat 


tttatgaagg 


aaacaaactc 


tgcgatgact 


ccgacaacag 


26286 


aacgcacaca 


tttccccacc 


cttcactaca 


attcgaatgc 


gtaaacaaaa 


gcttgaagct 


26346 


ttacaactta 


aagccttcag 


attctggctt 


gtaccatgct 


gtagttgaaa 


aaagtaattt 


26406 


agaagtccac 


agtgattaca 


ttgaattgac 


gqttgtggac 

33 3 33 


ctgccacctc 


caaaatgtga 


26466 


ggtttcctcc 


tcttaccttg 


aagttcaagg 


cgtgqatcrcc 

3 3 3 3 


tactgcct ca 


tacacattaa 


26526 


ctgcagcaac 


tctaaatatc 


cagctagaat 


ttactataat 


ggacaggaaa 


gtaatctttt 


26586 


ttattattta 


acaacaagcg 


ctggtaacgg 


taaacagtta 


cctgactatt 


ttactgctgt 


26646 


tgttgaattt 


tccacctaca 


gagaaacgta 


tgccaagcgg 


ccttacaatt 


tctcataccc 


26706 


gtttaacgac 


ctttgcaatg 


aaatacaagc 


gctcgaaact 


ggaactgatt 


ttactccaat 


26766 


tttcattgct 


gccattgttg 


taagcttaat 


taccattatt 


gtcagcctag 


cattttactg 


26826 


cttttacaag 


cccaaaaacc 


ctaagtttga 


aaaacttaaa 


ctaaaacctg 


tcattcaaca 


26886 


agtgtgattt 


tgttttccag 


catggtagct 


gcatttctac 


ttctcctctg 


tctacccatc 


26946 


_ x_ j_ 4_ j_ __,4__4_ 

attttcgtct 


cttcaacttt 


cgccgcagtt 


tcccacctgg 


aaccagagtg 


cctaccgcct 


27006 


tttgacgtgt 


atctgattct 


cacctttgtt 


tgttgtatat 


ccatttgcag 


tatagcctgc 


27066 


ttttttataa 


caatctttca 


agccgccgac 


tatttttacg 


tgcgaattgc 


ttactttaga 


27126 


caccatcctg 


aatacagaaa 


tcaaaacgtt 


gcctccttac 


tttgtttggc 


atgattaagt 


27186 
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tattgctgat 


acttaattat 


ttacccctaa 


tcaactgtaa 


ttgtccattc 


accaaaccct 


27246 


ggtcattcta 


cacctgttat 


gataaaatcc 


ccgacactcc 


tgttgcttgg 


ctttacgcag 


27306 


ccaccgccgc 


tttggtattt 


atatctactt 


gccttggagt 


aaaattgtat 


tttattttac 


27366 


acactgggtg 


gctacatccc 


agagaagatt 


tacctagata 


tcctcttgta 


aacgcttttc 


27426 


aattacagcc 


tctgcctcct 


cctgatcttc 


ttcctcgagc 


tccctctatt 


gtgagctact 


27486 


ttcaactcac 


cggtggagat 


gactgactct 


caggacatta 


atattagtgt 


ggaaagaata 


27546 


gctgctcagc 


gtcagcgaga 


aacgcgagtg 


ttggaatacc 


tggaactaca 


gcaacttaaa 


27606 


gagtcccact 


ggtgtgagaa 


aggagtgctg 


tgccatgtta 


agcaggcagc 


cctttcctac 


27666 


gatgtcagcg 


ttcagggaca 


tgaactgtct 


tacactttgc 


ctttgcagaa 


acaaaccttc 


27726 


tgcaccatga 


tgggctctac 


ctccatcaca 


atcacccaac 


aagccgggcc 


tgtagagggg 


27786 


gctatcctct 


gtcactgtca 


cgcacctgat 


tgcatgtcca 


aactaatcaa 


aactctctgt 


27846 


gctttaggtg 


atatttttaa 


ggtgtaaatc 


aataataaac 


ttaccttaaa 


tttgacaaca 


27906 


aatttctggt 


gacatcattc 


agcagcacca 


ctttaccctc 


ttcccagctc 


tcgtatggga 


27966 


tgcgatagtg 


ggtggcaaac 


ttcctccaaa 


ccctaaaaga 


aatattggta 


tccacttcct 


28026 


tgtcctcacc 


cacaattttc 


atcttttcat 


ag atg aaa 


aga acc aga gtt gat 


28079 



Met Lys Arg Thr Arg Val Asp 
1435 1440 

gaa gac ttc aac ccc gtc tac ccc tat gac acc aca acc act cct 28124 
Glu Asp Phe Asn Pro Val Tyr Pro Tyr Asp Thr Thr Thr Thr Pro 
1445 1450 1455 

gca gtt ccc ttt ata tea ccc ccc ttt gta aac age gat ggt ctt 28169 
Ala Val Pro Phe lie Ser Pro Pro Phe Val Asn Ser Asp Gly Leu 
1460 1465 1470 

cag gaa aac ccc cca ggt gtt tta agt ctg cga ata get aaa ccc 28214 
Gin Glu Asn Pro Pro Gly Val Leu Ser Leu Arg lie Ala Lys Pro 
1475 1480 1485 

eta tat ttc gac atg gag aga aaa eta gee ctt tea ctt gga aga 28259 
Leu Tyr Phe Asp Met Glu Arg Lys Leu Ala Leu Ser Leu Gly Arg 
1490 1495 1500 

ggg ttg aca att acc gee gee gga caa tta gaa agt acg cag age 28304 
Gly Leu Thr lie Thr Ala Ala Gly Gin Leu Glu Ser Thr Gin Ser 
1505 1510 1515 

gta caa acc aac cca ccg ttg ata att acc aac aac aac aca ctg 28349 
Val Gin Thr Asn Pro Pro Leu lie He Thr Asn Asn Asn Thr Leu 
1520 1525 1530 
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acc eta cgt cat tct ccc ccc tta aac eta act gac aat age tta 28394 

Thr Leu Arg His Ser Pro Pro Leu Asn Leu Thr Asp Asn Ser Leu 

1535 1540 1545 

gtg eta ggc tac teg agt cct etc cgc gtc aca gac aac aaa ctt 28439 

Val Leu Gly Tyr Ser Ser Pro Leu Arg Val Thr Asp Asn Lys Leu 

1550 1555 1560 

aca ttt aac ttc aca tea cca etc cgt tat gaa aat gaa aac ctt 28484 

Thr Phe Asn Phe Thr Ser Pro Leu Arg Tyr Glu Asn Glu Asn Leu 

1565 1570 1575 

act ttt aac tat aca gag cct ctt aaa ctt ata aat aac age ctt 28529 

Thr Phe Asn Tyr Thr Glu Pro Leu Lys Leu lie Asn Asn Ser Leu 

1580 1585 1590 

gee att gac ate aat tec tea aaa ggc ctt agt age gtc gga ggc 28574 

Ala lie Asp lie Asn Ser Ser Lys Gly Leu Ser Ser Val Gly Gly 

1595 1600 1605 

tea eta get gta aac ctg agt tea gac tta aag ttt gac age aac 28619 

Ser Leu Ala Val Asn Leu Ser Ser Asp Leu Lys Phe Asp Ser Asn 

1610 1615 1620 

gga tec ata get ttt ggc ata caa acc ctg tgg acc get ccg acc 28664 

Gly Ser lie Ala Phe Gly lie Gin Thr Leu Trp Thr Ala Pro Thr 

1625 1630 1635 

teg act ggc aac tgc acc gtc tac age gag ggc gat tec eta ctt 28709 

Ser Thr Gly Asn Cys Thr Val Tyr Ser Glu Gly Asp Ser Leu Leu 

1640 1645 1650 

agt etc tgt tta acc aaa tgc gga get cac gtc tta gga agt gta 28754 

Ser Leu Cys Leu Thr Lys Cys Gly Ala His Val Leu Gly Ser Val 

1655 1660 1665 

agt tta acc ggt tta aca gga acc ata acc caa atg act gat att 28799 

Ser Leu Thr Gly Leu Thr Gly Thr lie Thr Gin Met Thr Asp lie 

1670 1675 1680 

tct gtc acc att caa ttt aca ttt gac aac aat ggt aag eta eta 28844 

Ser Val Thr lie Gin Phe Thr Phe Asp Asn Asn Gly Lys Leu Leu 

1685 1690 1695 

age tct cca ctt ata aac aac gee ttt agt att cga cag aat gac 28889 

Ser Ser Pro Leu lie Asn Asn Ala Phe Ser lie Arg Gin Asn Asp 

1700 1705 1710 

agt acg gee tea aac cct acc tac aac gee ctg gcg ttt atg cct 28934 

Ser Thr Ala Ser Asn Pro Thr Tyr Asn Ala Leu Ala Phe Met Pro 

1715 1720 1725 

aac agt acc ata tat gca aga ggg gga ggt ggt gaa cca cga aac 28979 

Asn Ser Thr lie Tyr Ala Arg Gly Gly Gly Gly Glu Pro Arg Asn 

1730 1735 1740 
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aac tac tac gtc caa acg tat ctt agg gga aat gtt caa aaa cca 29024 
Asn Tyr Tyr Val Gin Thr Tyr Leu Arg Gly Asn Val Gin Lys Pro 
1745 1750 1755 

ate att ctt act gta acc tac aac tea gtc gec aca gga tat tec 29069 
lie lie Leu Thr Val Thr Tyr Asn Ser Val Ala Thr Gly Tyr Ser 
1760 1765 1770 

tta tct ttt aag tgg act get ctt gca cgt gaa aag ttt gca acc 29114 
Leu Ser Phe Lys Trp Thr Ala Leu Ala Arg Glu Lys Phe Ala Thr 
1775 1780 1785 

cca aca acc teg ttt tgc tac att aca gaa caa taa aaccgtgtac 29160 
Pro Thr Thr Ser Phe Cys Tyr He Thr Glu Gin 
1790 1795 

cccaccgttt cgtttttttc ag atg aaa egg gcg aga gtt gat gaa gac 29209 

Met Lys Arg Ala Arg Val Asp Glu Asp 
1800 1805 

ttc aac cca gtg tac cct tat gac ccc cca cat get cct gtt atg 29254 
Phe Asn Pro Val Tyr Pro Tyr Asp Pro Pro His Ala Pro Val Met 
1810 1815 1820 

ccc ttc att act cca cct ttt acc tec teg gat ggg ttg cag gaa 29299 
Pro Phe He Thr Pro Pro Phe Thr Ser Ser Asp Gly Leu Gin Glu 
1825 1830 ~ 5 1835 

aaa cca ctt gga gtg tta agt tta aac tac aga gat ccc att act 29344 
Lys Pro Leu Gly Val Leu Ser Leu Asn Tyr Arg Asp Pro He Thr 
1840 1845 ^ ~ ^ 1850 

acg caa aat gag tct ctt aca att aaa eta gga aac ggc etc act 29389 
Thr Gin Asn Glu Ser Leu Thr He Lys Leu Gly Asn Gly Leu Thr 
1855 I860. 1865 

eta gac aac cag gga caa eta aca tea acc get ggc gaa gta gaa 29434 
Leu Asp Asn Gin Gly Gin Leu Thr Ser Thr Ala Gly Glu Val Glu 
1870 1875 " 1880 

cct cca etc act aac get aac aac aaa ctt gca ctg gtc tat age 29479 
Pro Pro Leu Thr Asn Ala Asn Asn Lys Leu Ala Leu Val Tyr Ser 
1885 1890 1895 

gat cct tta gca gta aag cgc aac age eta acc tta teg cac acc 29524 
Asp Pro Leu Ala Val Lys Arg Asn Ser Leu Thr Leu Ser His Thr 
1900 1905 1910 

get ccc ctt gtt att get gat aac tct tta gca ttg caa gtt tea 29569 
Ala Pro Leu Val He Ala Asp Asn Ser Leu Ala Leu Gin Val Ser 
1915 1920 1925 

gag cct att ttt ata aat gac aag gac aaa eta gee ctg caa aca 29614 
Glu Pro He Phe He Asn Asp Lys Asp Lys Leu Ala Leu Gin Thr 
1930 1935 1940 
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gcc gcg ccc 
Ala Ala Pro 



gcc gcc cct 
Ala Ala Pro 



acc aac cct 
Thr Asn Pro 



gaa cga ccc 
Glu Arg Pro 



etc tec cca 
Leu Ser Pro 



acc aac gtg 
Thr Asn Val 



ata aag cgc 
lie Lys Arg 



ttc aga get 
Phe Arg Ala 

ctt aac ttt 
Leu Asn Phe 



gtt aac aca 
Val Asn Thr 



tta aat ctt 
Leu Asn Leu 



gtc get ctt 
Val Ala Leu 



cgc ctt aac 
Arg Leu Asn 



aaa ttg gac 
Lys Leu Asp 



ctt gta act 

Leu Val Thr 
1945 

tta ggc att 

Leu Gly lie 
1960 

ttg tac ttg 

Leu Tyr Leu 
1975 

ctt gcc att 

Leu Ala lie 
1990 

ccg eta caa 

Pro Leu Gin 
2005 

cca tta act 

Pro Leu Thr 
2020 

cca ctt att 

Pro Leu lie 
2035 

ccc ctg cgt 

Pro Leu Arg 
2050 

tac acc cct 

Tyr Thr Pro 
2065 

ggc cgc ggc 

Gly Arg Gly 
2080 

ggt aag gat 

Gly Lys Asp 
2095 

agt get get 

Ser Ala Ala 
2110 

gtg ggc get 

Val Gly Ala 
2125 

gtg aac ccc 

Val Asn Pro 
2140 



aac get ggc 
Asn Ala Gly 
1950 

gca gac caa 
Ala Asp Gin 
1965 

cag aat aac 
Gin Asn Asn 
1980 

acc aat act 
Thr Asn Thr 
1995 

aca gca gac 
Thr Ala Asp 
2010 

gta age aac 
Val Ser Asn 
2025 

att cag gac 
lie Gin Asp 
2040 

ctt ttc aac 
Leu Phe Asn 
2055 

ctt gcg gta 
Leu Ala Val 
2070 

etc aca gtg 
Leu Thr Val 
2085 

ctt cgc ttt 
Leu Arg Phe 
2100 

ttg cct tta 
Leu Pro Leu 
2115 

ggg ctg cgt 
Gly Leu Arg 
2130 

aat caa aac 
Asn Gin Asn 
2145 



acc ctt cgc 
Thr Leu Arg 



acc eta aaa 
Thr Leu Lys 



ttt etc acg 
Phe Leu Thr 



gga aag ctg 
Gly Lys Leu 



aca ggc ttg 
Thr Gly Leu 



ggg acc eta 
Gly Thr Leu 



aac aac ttg 
Asn Asn Leu 



age gac cca 
Ser Asp Pro 



cgc gat gag 
Arg Asp Glu 



agt tac gat 
Ser Tyr Asp 



gac aac aac 
Asp Asn Asn 



caa tac act 
Gin Tyr Thr 



tac aat cca 
Tyr Asn Pro 



aag ggt tta 
Lys Gly Leu 



tta caa age 

Leu Gin Ser 
1955 

etc ctg ttt 

Leu Leu Phe 
1970 

tta gcc att 

Leu Ala lie 
1985 

get eta cag 

Ala Leu Gin 
2000 

act ttg caa 

Thr Leu Gin 
2015 

ggc tta gcc 

Gly Leu Ala 
2030 

ttt ttg gac 

Phe Leu Asp 
2045 

gta eta ggg 

Val Leu Gly 
2060 

gcg etc act 

Ala Leu Thr 
2075 

ggt tta att 

Gly Leu lie 
2090 

acc gtt tct 

Thr Val Ser 

2105 1 

gat cag ctt 

Asp Gin Leu 
2120 

gtg agt aag 

Val Ser Lys 
2135 

acc tgg gaa 

Thr Trp Glu 
2150 



29659 



29704 



29749 



29794 



29839 



29884 



29929 



29974 



30019 



30064 



30109 



30154 



30199 



30244 
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aat gac tac etc att gta aag eta gga aat gga tta ggt ttt gat 30289 

Asn Asp Tyr Leu lie Val Lys Leu Gly Asn Gly Leu Gly Phe Asp 
2155 2160 2165 

ggc gat gga aac ata get gtt tct cct caa gtt aca teg cct gac 30334 

Gly Asp Gly Asn lie Ala Val Ser Pro Gin Val Thr Ser Pro Asp 
2170 2175 2180 

ace tta tgg ace act gec gac cca tec ccc aat tgt tec ate tac 30379 

Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Ser lie Tyr 
2185 2190 2195 

act gat tta gat gee aaa atg tgg etc teg ttg gta aaa caa ggg 30424 

Thr Asp Leu Asp Ala Lys Met Trp Leu Ser Leu Val Lys Gin Gly 
2200 2205 2210 

ggt gtg gtt cac ggt tct gtt get tta aaa gca ttg aaa gga ace 30469 

Gly Val Val His Gly Ser Val Ala Leu Lys Ala Leu Lys Gly Thr 
2215 2220 2225 

eta ttg agt cct acg gaa age gee att gtt att ata eta cat ttt 30514 

Leu Leu Ser Pro Thr Glu Ser Ala He Val He He Leu His Phe 
2230 2235 2240 

gac aat tat gga gtg cga att etc aat tat ccc act ttg ggc act 30559 

Asp Asn Tyr Gly Val Arg lie Leu Asn Tyr Pro Thr Leu Gly Thr 
2245 2250 2255 

caa ggc acg ttg gga aat aat gca act tgg ggt tat agg cag gga 30604 

Gin Gly Thr Leu Gly Asn Asn Ala Thr Trp Gly Tyr Arg Gin Gly 
2260 2265 2270 

gaa tct gca gac act aat gta etc aat gca eta gca ttt atg ccc 30649 

Glu Ser Ala Asp Thr Asn Val Leu Asn Ala Leu Ala Phe Met Pro 
2275 2280 2285 

agt tea aaa agg tac cca aga ggg cgt gga age gaa gtt cag aat 30694 

Ser Ser Lys Arg Tyr Pro Arg Gly Arg Gly Ser Glu Val Gin Asn 
2290 2295 2300 



caa act gtg ggc tac act tgt ata cag ggt gac ttt tct atg ccc 
Gin Thr Val Gly Tyr Thr Cys He Gin Gly Asp Phe Ser Met Pro 
2305 "* 2310 2315 



30739 



gta ccg tac caa ata cag tac aac tat gga cca act ggc tac tec 30784 

Val Pro Tyr Gin He Gin Tyr Asn Tyr Gly Pro Thr Gly Tyr Ser 
2320 2325 2330 

ttt aaa ttt att tgg aga act gtt tea aga caa cca ttt gac ate 30829 

Phe Lys Phe He Trp Arg Thr Val Ser Arg Gin Pro Phe Asp He 
2335 2340 2345 

cca tgc tgt ttt ttc tct tac att acg gaa gaa taa aacaactttt 30875 

Pro Cys Cys Phe Phe Ser Tyr lie Thr Glu Glu 
2350 2355 

tctttttatt ttctttttat tttacacgea cagtaaggct tcctccaccc ttccatctca 30935 
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cagcatacac 


cagcctctcc 


cccttcatgg 


cagtaaactg 


ttgtgagtca 


gtccggtatt 


30995 


tgggagttaa 


gatccaaaca 


gtctctttgg 


tgatgaaaca 


tggatccgtg 


atggacacaa 


31055 


atccctggga 


caggttctcc 


aacgtttcgg 


taaaaaactg 


catgccgccc 


tacaaaacaa 


31115 


acaggttcag 


gctctccacg 


ggttatctcc 


ccgatcaaac 


tcagacagag 


taaaggtgcg 


31175 


atgatgttcc 


actaaaccac 


gcaggtggcg 


ctgtctgaac 


ctctcggtgc 


gactcctgtg 


31235 


aggctggtaa 


gaagttagat 


tgtccagcag 


cctcacagca 


tggatcatca 


gtctacgagt 


31295 


gcgtctggcg 


cagcagcgca 


tctgaatctc 


actgagattc 


cggcaagaat 


cgcacaccat 


31355 


cacaatcagg 


ttgttcatga 


tcccatagct 


gaacacgctc 


cagccaaagc 


tcattcgctc 


31415 


caacagcgcc 


accgcgtgtc 


cgtccaacct 


tactttaaca 


taaatcaggt 


gtctgccgcg 


31475 


tacaaacatg 


ctacccgcat 


acagaacctc 


ccggggcaaa 


cccctgttca 


ccacctgcct 


31535 


gtaccaggga 


aacctcacat 


ttatcaggga 


gccatagata 


gccattttaa 


accaattagc 


31595 


taacaccgcc 


ccaccagctc 


tacactgaag 


agaaccggga 


gagttacaat 


gacagtgaat 


31655 


aatccatctc 


tcataacccc 


taatggtctg 


atggaaatcc 


agatctaacg 


tggcacagca 


31715 


gatacacact 


ttcatataca 


ttttcatcac 


atgtttttcc 


caggccgtta 


aaatacaatc 


31775 


ccaatacacg 


ggccactcct 


gcagtacaat 


aaagctaata 


caagatggta 


tactcctcac 


31835 


ctcactaaca 


ttgtgcatgt 


tcatattttc 


acattctaag 


taccgagagt 


tctcctctac 


31895 


aacagcactg 


ccgcggtcct 


cacaaggtgg 


tagctggtga 


cgattgtaag 


gagccagtct 


31955 


gcagcgatac 


cgtctgtcgc 


gttgcatcgt 


agaccaggga 


ccgacgcact 


tcctcgtact 


32015 


tgtagtagca 


gaaccacgtc 


cgctgccagc 


acgtctccaa 


gtaacgccgg 


tccctgcgtc 


32075 


gctcacgctc 


cctcctcaac 


gcaaagtgca 


accactcttg 


taatccacac 


agatccctct 


32135 


cggcctccgg 


ggcgatgcac 


acctcaaacc 


tacagatgtc 


tcggtacagt 


tccaaacacg 


32195 


tagtgagggc 


gagttccaac 


caagacagac 


agcctgatct 


atcccgacac 


actggaggtg 


32255 


gaggaagaca 


cggaagaggc 


atgttattcc 


aagcgattca 


ccaacgggtc 


gaaatgaaga 


32315 


tcccgaagat 


gacaacggtc 


gcctccggag 


ccctgatgga 


atttaacagc 


cagatcaaac 


32375 


attatgcgat 


tttccaggct 


atcaatcgcg 


gcctccaaaa 


gagcctggac 


ccgcacttcc 


32435 


acaaacacca 


gcaaagcaaa 


agcgttatta 


tcaaactctt 


cgatcatcaa 


gctgcaggac 


32495 


tgtacaatgc 


ccaagtaatt 


ttcatttctc 


cactcgcgaa 


tgatgtcgcg 


gcaaatagtc 


32555 


tgaaggttca 


tgccgtgcat 


attaaaaagc 


tccgaaaggg 


cgccctctat 


agccatgcgt 


32615 
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agacacacca 
gacccaggtc 
aattaaatag 
tggctacgca 
agcaaaactg 
gctcctttaa 
cgggaatatg 
agaatcatta 
ggcaagcaac 
caacagagac 
tcacattcat 
acagctgcaa 
aaagcacata 
aatgcacata 
aaatcgagct 
taataccgca 
cgccgtcagc 
tcccacacac 
cacgtcaccc 
aataaggtat 
ggtgaaaaaa 
cccagcaaat 
cacgaaaaaa 
atgacgctaa 
aatccagcgc 
ttcactttcc 
gtcaccgcac 
tttccagaat 



tcatgactgc 
aggttgctct 
atctgcgccg 
gcacaaaagt 
atgaatggga 
aaagtccagt 
cacagcaaaa 
aactaaagaa 
cggctgtccc 
ttcccaccat 
atccgctaca 
agacagcaaa 
catattagaa 
aatatgttca 
aaaatccacc 
ccacccacga 
acttccgcaa 
gcccttcgcc 
cggccccgcc 
attattgatg 
gcacatacat 
gcacataaat 
tcgagctaaa 
taccgcacca 
cgtcagcact 
cacacacgcc 
gtcaccccgg 
aaggtatatt 



aagatatcgg 
ccgcgatcgc 
actaaatctg 
tccagggatg 
gtaacacagt 
acttctatat 
aaaatagggc 
gcctggcgaa 
cgggaaccgc 
gtactcggtt 
gaaaaaaaac 
acaatccctc 
taaccctgtt 
tcagccattg 
caacagccta 
ccaaagttca 
tttcagtctc 
cgcccgccct 
tcgctcctcc 
cagcaaaaca 
attagaataa 
atgttcatca 
atccacccaa 
cccacgacca 
tccgcaattt 
cttcgcccgc 
ccccgcctcg 
attgatgca 



gctcctgaga 
gaatctccat 
ttaactccgc 
gcgccaaact 
gtaaaatgtt 
tcagttcgtg 
ggctcagata 
cggtgggata 
ggtaaaattc 
ggatctcctg 
gtcccagata 
tgggagcaat 
gctggggcaa 
ccccgtctta 
tagctatata 
cccacaccca 
acaacgtcac 
cgcgccaccc 
ccgctcatta 
atccctctgg 
ccctgttgct 
gccattgccc 
cagcctatag 
aagttcaccc 
cagtctcaca 
ccgccctcgc 
ctcctccccg 



cacctgcagc 
ccgcaaagtc 
gctaggaact 
cactagaacc 
cagccaaaaa 
caagtactga 
catgttgacc 
tatgacacgc 
atccgaatga 
agcacagagc 
cccagcggga 
cacaaaatcc 
aaaggcccgt 
ccgcgtaaac 
tacactccac 
caaaacccgc 
ttccgcgcgc 
cgcgtcaccc 
tcatattggc 
gagcaatcac 
ggggcaaaaa 
cgtcttaccg 
ctatatatac 
acacccacaa 
acgtcacttc 
gccaccccgc 
ctcattatca 



agatttaaca 
atttgcaaat 
aaatcaggtg 
gctcccgagt 
tcactaagct 
agcaactgtg 
taaaataaaa 
tccagcagca 
ttaaaaagaa 
aatacccccc 
atatccaacg 
tccggtgaaa 
cgtcccagca 
agccacgaaa 
ccaatgacgc 
gaaaatccag 
cttttcactt 
cacgtcaccg 
acgtttccag 
aaaatcctcc 
ggcccgtcgt 
cgtaaacagc 
actccaccca 
aacccgcgaa 
cgcgcgcctt 
gtcaccccac 
tattggcacg 



32675 
32735 
32795 
32855 
32915 
32975 
33035 
33095 
33155 
33215 
33275 
33335 
33395 
33455 
33515 
33575 
33635 
33695 
33755 
33815 
33875 
33935 
33995 
34055 
34115 
34175 
34235 
34264 
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<210> 25 

<211> 503 

<212> PRT 

<213> simian adenovirus SV-1 

<400> 25 

Met Arg Arg Ala Val Arg Val Thr Pro Ala Ala Tyr Glu Gly Pro Pro 

1 "* 5 10 15 

Pro Ser Tyr Glu Ser Val Met Gly Ser Ala Asn Val Pro Ala Thr Leu 
20 25 30 

Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Gly Pro Thr Glu Gly Arg 
35 40 45 

Asn Ser lie Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Lys 
50 55 60 

Val Tyr Leu Val Asp Asn Lys Ser Ala Asp lie Ala Ser Leu Asn Tyr 
65 70 75 80 

Gin Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp 
85 90 95 

Phe Thr Pro Thr Glu Ala Gly Thr Gin Thr lie Asn Phe Asp Glu Arg 
100 105 110 

Ser Arg Trp Gly Gly Gin Leu Lys Thr lie Leu His Thr Asn Met Pro 
115 120 125 

Asn lie Asn Glu Phe Met Ser Thr Asn Lys Phe Arg Ala Arg Leu Met 
130 135 ~ 140 

Val Lys Lys Ala Glu Asn Gin Pro Pro Glu Tyr Glu Trp Phe Glu Phe 
145 150 ' 155 160 

Thr lie Pro Glu Gly Asn Tyr Ser Glu Thr Met Thr lie Asp Leu Met 
165 170 175 

Asn Asn Ala lie Val Asp Asn Tyr Leu Gin Val Gly Arg Gin Asn Gly 
180 185 190 

Val Leu Glu Ser Asp lie Gly Val Lys Phe Asp Thr Arg Asn Phe Arg 
195 200 205 

Leu Gly Trp Asp Pro Val Thr Lys Leu Val Met Pro Gly Val Tyr Thr 
210 215 220 

Asn Glu Ala Phe His Pro Asp He Val Leu Leu Pro Gly Cys Gly Val 
225 230 235 240 

Asp Phe Thr Gin Ser Arg Leu Ser Asn Leu Leu Gly He Arg Lys Arg 
245 250 255 

Arg Pro Phe Gin Glu Gly Phe Gin He Met Tyr Glu Asp Leu Glu Gly 
260 265 270 
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Gly Asn lie Pro Gly 
275 

Lys Gin Ala Glu Ala 
290 

Thr Glu Pro His Glu 
305 

Lys Arg Ser Tyr Asn 
325 

Ser Trp Phe Leu Ala 
340 

Ser Trp Thr lie Leu 
355 

Val Tyr Trp Ser Leu 
370 

Pro Ser Thr Gin Val 
385 

Pro Val His Ala Lys 
405 

Leu lie Arg Gin Ser 
420 

Glu Asn Gin lie Leu 
435 

Ser Glu Asn Val Pro 
450 

Ser Ser lie Ser Gly 
465 

Arg Thr Cys Pro Tyr 
485 



Leu Leu Asp Val Pro Ala 
280 

Gin Gly Arg Glu lie Arg 
295 

Leu Val lie Lys Pro Leu 
310 315 

lie lie Ser Gly Thr Met 
330 

Tyr Asn Tyr Gly Asp Pro 
345 

Thr Thr Thr Asp Val Thr 
360 

Pro Asp Met Met Gin Asp 
375 

Ser Asn Phe Pro Val Val 
390 395 

Ser Phe Tyr Asn Glu Gin 
410 

Thr Ala Leu Thr His Val 
425 

Val Arg Pro Pro Ala Pro 
440 

Ala Leu Thr Asp His Gly 
455 

Val Gin Arg Val Thr lie 
470 475 

Val Tyr Lys Ala Leu Gly 
490 



Tyr Glu Glu Ser Val 
285 

Gly Asp Thr Phe Ala 
300 

Glu Gin Asp Ser Lys 
320 

Asn Thr Leu Tyr Arg 
335 

Glu Lys Gly Val Arg 
350 

Cys Gly Ser Gin Gin 

365 

Pro Val Thr Phe Arg 
380 

Gly Thr Glu Leu Leu 
400 

Ala Val Tyr Ser Gin 
415 

Phe Asn Arg Phe Pro 
430 

Thr lie Thr Thr Val 
445 

Thr Leu Pro Leu Arg 
460 

Thr Asp Ala Arg Arg 
480 

Val Val Ala Pro Lys 
495 



Val Leu Ser Ser Arg Thr Phe 
500 



<210> 26 

<211> 931 

<212> PRT 

<213> simian adenovirus SV-1 

<400> 26 



Met Ala Thr Pro Ser Met Met Pro Gin Trp Ser Tyr Met His lie Ala 
15 10 15 
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Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 
20 25 30 

Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 
35 40 45 

Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 
50 55 60 

Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Thr Ala Tyr Ser Tyr 
65 70 75 ~ 80 

Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 
85 90 95 

Ala Ser Thr Tyr Phe Asp lie Arg Gly Val Leu Asp Arg Gly Pro Ser 
100 105 110 

Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 
115 120 125 

Ala Pro Asn Pro Ala Glu Trp Thr Asn Ser Asp Ser Lys Val Lys Val 
130 135 140 

Arg Ala Gin Ala Pro Phe Val Ser Ser Tyr Gly Ala Thr Ala lie Thr 
145 150 155 160 

Lys Glu Gly lie Gin Val Gly Val Thr Leu Thr Asp Ser Gly Ser Thr 
165 170 175 

Pro Gin Tyr Ala Asp Lys Thr Tyr Gin Pro Glu Pro Gin lie Gly Glu 
180 185 190 

Leu Gin Trp Asn Ser Asp Val Gly Thr Asp Asp Lys lie Ala Gly Arg 
195 200 205 

Val Leu Lys Lys Thr Thr Pro Met Phe Pro Cys Tyr Gly Ser Tyr Ala 
210 215 " 220 

Arg Pro Thr Asn Glu Lys Gly Gly Gin Ala Thr Pro Ser Ala Ser Gin 
225 230 235 240 

Asp Val Gin Asn Pro Glu Leu Gin Phe Phe Ala Ser Thr Asn Val Ala 
245 250 255 

Asn Thr Pro Lys Ala Val Leu Tyr Ala Glu Asp Val Ser lie Glu Ala 
260 265 270 

Pro Asp Thr His Leu Val Phe Lys Pro Thr Val Thr Glu Gly lie Thr 
275 280 285 

Ser Ser Glu Ala Leu Leu Thr Gin Gin Ala Ala Pro Asn Arg Pro Asn 
290 295 300 

Tyr lie Ala Phe Arg Asp Asn Phe lie Gly Leu Met Tyr Tyr Asn Ser 
305 310 315 " 320 
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Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala 
325 330 335 

Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Met 
340 345 350 

Leu Asp Ala Leu Gly Asp Arg Ser Arg Tyr Phe Ser Met Trp Asn Gin 
355 360 365 

Ala Val Asp Ser Tyr Asp Pro Asp Val Arg He He Glu Asn His Gly 
370 ~ 375 380 

Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly Met Ala 
385 390 395 400 

Val Thr Asp Thr Tyr Ser Pro He Lys Val Asn Gly Gly Gly Asn Gly 
405 410 415 

Trp Glu Ala Asn Asn Gly Val Phe Thr Glu Arg Gly Val Glu He Gly 
420 425 430 

Ser Gly Asn Met Phe Ala Met Glu He Asn Leu Gin Ala Asn Leu Trp 
435 440 445 

Arg Ser Phe Leu Tyr Ser Asn He Gly Leu Tyr Leu Pro Asp Ser Leu 
450 455 460 

Lys He Thr Pro Asp Asn He Thr Leu Pro Glu Asn Lys Asn Thr Tyr 
465 470 475 480 

Gin Tyr Met Asn Gly Arg Val Thr Pro Pro Gly Leu Val Asp Thr Tyr 
485 490 495 

Val Asn Val Gly Ala Arg Trp Ser Pro Asp Val Met Asp Ser He Asn 
500 505 510 

Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu 
515 520 525 

Leu Gly Asn Gly Arg Tyr Val Pro Phe His He Gin Val Pro Gin Lys 
530 535 540 

Phe Phe Ala He Lys Asn Leu Leu Leu Leu Pro Gly Ser Tyr Thr Tyr 
545 550 555 560 

Glu Trp Asn Phe Arg Lys Asp Val Asn Met He Leu Gin Ser Ser Leu 
565 570 575 

Gly Asn Asp Leu Arg Val Asp Gly Ala Ser He Arg Phe Asp Ser He 
580 585 590 

Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala His Asn Thr Ala Ser Thr 
595 600 605 

Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser Phe Asn Asp 
610 615 620 
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Tyr Leu Cys Ala Ala Asn Met Leu Tyr Pro lie Pro Ala Asn Ala Thr 
625 630 635 640 

Ser Val Pro lie Ser lie Pro Ser Arg Asn Trp Ala Ala Phe Arg Gly 
645 650 655 

Trp Ser Phe Thr Arg Leu Lys Thr Lys Glu Thr Pro Ser Leu Gly Ser 
660 665 670 

Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Ser lie Pro Tyr Leu Asp 
675 680 685 

Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val Ser lie Met Phe 
690 695 700 

Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn 
705 710 715 720 

Glu Phe Glu lie Lys Arg Ser Val Asp Gly Glu Gly Tyr Asn Val Ala 
725 730 735 

Gin Ser Asn Met Thr Lys Asp Trp Phe Leu lie Gin Met Leu Ser His 
740 745 750 

Tyr Asn lie Gly Tyr Gin Gly Phe Tyr Val Pro Glu Asn Tyr Lys Asp 
755 ~ 760 765 

Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg Gin lie 
770 ~ 775 780 

Val Asp Ser Thr Ala Tyr Thr Asn Tyr Gin Asp Val Lys Leu Pro Tyr 
785 790 795 800 

Gin His Asn Asn Ser Gly Phe Val Gly Tyr Met Gly Pro Thr Met Arg 
805 810 815 

Glu Gly Gin Ala Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu lie Gly Ala 
820 825 830 

Thr Ala Val Pro Ser Leu Thr Gin Lys Lys Phe Leu Cys Asp Arg Val 
835 840 845 

Met Trp Arg lie Pro Phe Ser Ser Asn Phe Met Ser Met Gly Ser Leu 
850 855 860 

Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala Asn Ser Ala His Ala Leu 
865 " 1 870 875 880 

Asp Met Thr Phe Glu Val Asp Pro Met Asp Glu Pro Thr Leu Leu Tyr 
885 890 895 

Val Leu Phe Glu Val Phe Asp Val Val Arg lie His Gin Pro His Arg 
900 905 910 
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Gly Val lie Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala Gly Asn 
915 920 925 

Ala Thr Thr 
930 



<210> 27 
<211> 363 
<212> PRT 

<213> simian adenovirus SV-1 
<400> 27 

Met Lys Arg Thr Arg Val Asp Glu Asp Phe Asn Pro Val Tyr Pro Tyr 
15 10 15 

Asp Thr Thr Thr Thr Pro Ala Val Pro Phe lie Ser Pro Pro Phe Val 

20 25 30 

Asn Ser Asp Gly Leu Gin Glu Asn Pro Pro Gly Val Leu Ser Leu Arg 
35 40 ~ 45 

lie Ala Lys Pro Leu Tyr Phe Asp Met Glu Arg Lys Leu Ala Leu Ser 
50 55 ~ 60 

Leu Gly Arg Gly Leu Thr lie Thr Ala Ala Gly Gin Leu Glu Ser Thr 
65 70 75 80 

Gin Ser Val Gin Thr Asn Pro Pro Leu lie lie Thr Asn Asn Asn Thr 
85 90 95 

Leu Thr Leu Arg His Ser Pro Pro Leu Asn Leu Thr Asp Asn Ser Leu 
100 105 110 

Val Leu Gly Tyr Ser Ser Pro Leu Arg Val Thr Asp Asn Lys Leu Thr 
115 120 125 

Phe Asn Phe Thr Ser Pro Leu Arg Tyr Glu Asn Glu Asn Leu Thr Phe 
130 135 140 

Asn Tyr Thr Glu Pro Leu Lys Leu lie Asn Asn Ser Leu Ala lie Asp 
145 150 155 160 

lie Asn Ser Ser Lys Gly Leu Ser Ser Val Gly Gly Ser Leu Ala Val 
165 170 175 

Asn Leu Ser Ser Asp Leu Lys Phe Asp Ser Asn Gly Ser lie Ala Phe 
180 185 190 

Gly lie Gin Thr Leu Trp Thr Ala Pro Thr Ser Thr Gly Asn Cys Thr 
195 200 205 

Val Tyr Ser Glu Gly Asp Ser Leu Leu Ser Leu Cys Leu Thr Lys Cys 
210 215 220 
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Gly Ala His Val 

225 

lie Thr Gin Met 



Asn Asn Gly Lys 
260 



lie Arg Gin Asn 
275 

Ala Phe Met Pro 
290 

Pro Arg Asn Asn 
305 

Lys Pro lie lie 



Ser Leu Ser Phe 
340 

Pro Thr Thr Ser 
355 



Leu Gly Ser Val 
230 

Thr Asp lie Ser 
245 

Leu Leu Ser Ser 



Asp Ser Thr Ala 
280 

Asn Ser Thr lie 
295 

Tyr Tyr Val Gin 
310 

Leu Thr Val Thr 

325 

Lys Trp Thr Ala 



Phe Cys Tyr lie 
360 



Ser Leu Thr Gly 
235 

Val Thr lie Gin 
250 

Pro Leu lie Asn 
265 

Ser Asn Pro Thr 



Tyr Ala Arg Gly 
300 

Thr Tyr Leu Arg 
315 

Tyr Asn Ser Val 
330 

Leu Ala Arg Glu 
345 

Thr Glu Gin 



Leu Thr Gly Thr 
240 

Phe Thr Phe Asp 
255 

Asn Ala Phe Ser 
270 

Tyr Asn Ala Leu 
285 

Gly Gly Gly Glu 



Gly Asn Val Gin 
320 

Ala Thr Gly Tyr 
335 

Lys Phe Ala Thr 
350 



<210> 


28 




<211> 


560 




<212> 


PRT 




<213> 


simian adenovirus SV- 


•1 


<400> 


28 




Met Lys 


Arg Ala Arg Val Asp 


Glu 


1 


5 





10 15 

Asp Pro Pro His Ala Pro Val Met Pro Phe lie Thr Pro Pro Phe Thr 
20 25 30 

Ser Ser Asp Gly Leu Gin Glu Lys Pro Leu Gly Val Leu Ser Leu Asn 
35 40 45 

Tyr Arg Asp Pro lie Thr Thr Gin Asn Glu Ser Leu Thr lie Lys Leu 
50 55 60 

Gly Asn Gly Leu Thr Leu Asp Asn Gin Gly Gin Leu Thr Ser Thr Ala 
65 ~ 70 75 80 

Gly Glu Val Glu Pro Pro Leu Thr Asn Ala Asn Asn Lys Leu Ala Leu 
85 90 95 

Val Tyr Ser Asp Pro Leu Ala Val Lys Arg Asn Ser Leu Thr Leu Ser 
100 105 110 
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His Thr Ala Pro Leu Val lie Ala Asp Asn Ser Leu Ala Leu Gin Val 
115 120 125 

Ser Glu Pro lie Phe lie Asn Asp Lys Asp Lys Leu Ala Leu Gin Thr 
130 135 140 

Ala Ala Pro Leu Val Thr Asn Ala Gly Thr Leu Arg Leu Gin Ser Ala 
145 150 155 160 

Ala Pro Leu Gly lie Ala Asp Gin Thr Leu Lys Leu Leu Phe Thr Asn 
165 170 175 

Pro Leu Tyr Leu Gin Asn Asn Phe Leu Thr Leu Ala He Glu Arg Pro 
180 185 190 

Leu Ala He Thr Asn Thr Gly Lys Leu Ala Leu Gin Leu Ser Pro Pro 
195 200 205 

Leu Gin Thr Ala Asp Thr Gly Leu Thr Leu Gin Thr Asn Val Pro Leu 
210 215 220 

Thr Val Ser Asn Gly Thr Leu Gly Leu Ala He Lys Arg Pro Leu lie 
225 * 230 235 240 

He Gin Asp Asn Asn Leu Phe Leu Asp Phe Arg Ala Pro Leu Arg Leu 
245 250 255 

Phe Asn Ser Asp Pro Val Leu Gly Leu Asn Phe Tyr Thr Pro Leu Ala 
260 265 270 

Val Arg Asp Glu Ala Leu Thr Val Asn Thr Gly Arg Gly Leu Thr Val 
275 280 285 

Ser Tyr Asp Gly Leu He Leu Asn Leu Gly Lys Asp Leu Arg Phe Asp 
290 295 300 

Asn Asn Thr Val Ser Val Ala Leu Ser Ala Ala Leu Pro Leu Gin Tyr 
305 310 315 320 

Thr Asp Gin Leu Arg Leu Asn Val Gly Ala Gly Leu Arg Tyr Asn Pro 
325 330 335 

Val Ser Lys Lys Leu Asp Val Asn Pro Asn Gin Asn Lys Gly Leu Thr 
340 345 350 

Trp Glu Asn Asp Tyr Leu He Val Lys Leu Gly Asn Gly Leu Gly Phe 
355 360 365 

Asp Gly Asp Gly Asn He Ala Val Ser Pro Gin Val Thr Ser Pro Asp 
370 ~ 375 380 

Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Ser He Tyr Thr 
385 390 395 400 

Asp Leu Asp Ala Lys Met Trp Leu Ser Leu Val Lys Gin Gly Gly Val 
405 410 415 
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Val His Gly Ser Val 
420 

Pro Thr Glu Ser Ala 
435 

Val Arg lie Leu Asn 
450 

Asn Asn Ala Thr Trp 
465 

Val Leu Asn Ala Leu 
485 

Gly Arg Gly Ser Glu 
500 

Gin Gly Asp Phe Ser 
515 

Gly Pro Thr Gly Tyr 
530 

Gin Pro Phe Asp lie 
545 



Ala Leu Lys Ala Leu Lys 
425 

lie Val lie lie Leu His 
440 

Tyr Pro Thr Leu Gly Thr 
455 

Gly Tyr Arg Gin Gly Glu 
470 475 

Ala Phe Met Pro Ser Ser 
490 

Val Gin Asn Gin Thr Val 
505 

Met Pro Val Pro Tyr Gin 
520 

Ser Phe Lys Phe lie Trp 
535 

Pro Cys Cys Phe Phe Ser 
550 555 



Gly Thr Leu Leu Ser 
430 

Phe Asp Asn Tyr Gly 
445 

Gin Gly Thr Leu Gly 
460 

Ser Ala Asp Thr Asn 
480 



Lys Arg Tyr Pro Arg 
495 

Gly Tyr Thr Cys lie 
510 

lie Gin Tyr Asn Tyr 
525 

Arg Thr Val Ser Arg 
540 

Tyr lie Thr Glu Glu 
560 



<210> 29 

<211> 31044 

<212> DNA 

<213> simian adenovirus SV-25 
<220> 

<221> CDS 

<222> (12284) . . (13801) 

<223> Penton 

<220> 

<221> CDS 

<222> (16681) . . (19446) 

<223> Hexon 

<220> 

<221> CDS 

<222> (25380) . . (26423) 

<223> Fiber #2 

<220> 

<221> CDS 

<222> (26457) . . (28136) 

<223> Fiber #1 

<400> 29 

catcatcaat aatatacctt attctggaaa 



cgtgccaata tgataatgag cggggaggag 60 
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cgaggcgggg 


ccggggtgac 


gtgcggtgac 


gcggggtggc 


gcgagggcgg 


ggcgaagggc 


120 


gcgggtgtgt 


gtgtgggagg 


cgcttagttt 


ttacgtatgc 


ggaaggaggt 


tttataccgg 


180 


aagatgggta 


atttgggcgt 


atacttgtaa 


gttttgtgta 


atttggcgcg 


aaaactgggt 


240 


aatgaggaag 


ttgaggttaa 


tatgtacttt 


ttatgactgg 


gcggaatttc 


tgctgatcag 


300 


cagtgaactt 


tgggcgctga 


cggggaggtt 


tcgctacgtg 


acagtaccac 


gagaaggctc 


360 


aaaggtccca 


tttattgtac 


tcttcagcgt 


tttcgctggg 


tatttaaacg 


ctgtcagatc 


420 


atcaagaggc 


cactcttgag 


tgctggcgag 


aagagttttc 


tcctccgtgc 


tgccacgatg 


480 


aggctggtcc 


ccgagatgta 


cggtgttttt 


agcgacgaga 


cggtgcgtaa 


ctcagatgac 


540 


ctgctgaatt 


cagacgcgct 


ggaaatttcc 


aattcgcctg 


tgctttcgcc 


gccgtcactt 


600 


cacgacctgt 


ttgtgttttg 


gctcaacgct 


tagcaacgtg 


ttatataggg 


tcaagaagga 


660 


gcaggagacg 


cagtttgcta 


ggctgttggc 


cgatactcct 


ggagtttttg 


tggctctgga 


720 


tctaggccat 


cactctcttt 


tccaagagaa 


aattatcaaa 


aacttaactt 


ttacgtctcc 


780 


tggtcgcacg 


gttgcttccg 


ctgcctttat 


tacctatatt 


ttggatcaat 


ggagcaacag 


840 


cgacagccac 


ctgtcgtggg 


agtacatgct 


ggattacatg 


tcgatggcgc 


tgtggagggc 


900 


catgctgcgg 


aggagggttt 


gcatttactt 


gcgggcgcag 


cctccgcggc 


tggaccgagt 


960 


ggaggaggag 


gacgagccgg 


gggagaccga 


gaacctgagg 


gccgggctgg 


accctccaac 


1020 


ggaggactag 


gtgctgagga 


tgatcccgaa 


gaggggacta 


gtggggctag 


gaagaagcaa 


1080 


aagactgagt 


ctgaacctcg 


aaactttttg 


aatgagttga 


ctgtgagttt 


gatgaatcgt 


1140 


cagcgtccgg 


agacaatttt 


ctggtctgaa 


ttggaggagg 


aattcaggag 


gggggaactg 


1200 


aacctgctat 


acaagtatgg 


gtttgaacag 


ttaaaaactc 


actggttgga 


gccgtgggag 


1260 


gattttgaaa 


ccgccttgga 


cacttttgct 


aaagtggctc 


tgcggccgga 


taaggtttac 


1320 


actatccgcc 


gcactgttaa 


cataaagaag 


agtgtttatg 


ttataggcca 


tggagctctg 


1380 


gtgcaggtgc 


aaaccgtcga 


ccgggtggcc 


tttagttgcg 


gtatgcaaaa 


tctgggcccc 


1440 


ggggtgatag 


gcttaaatgg 


tgtaacattt 


cacaatgtaa 


ggtttactgg 


tgaaagtttt 


1500 


aacggctctg 


tgtttgcaaa 


taacacacag 


ctgacgctcc 


acggcgttta 


cttttttaac 


1560 


tttaataaca 


catgtgtgga 


gtcgtggggc 


agggtgtctt 


tgaggggctg 


ctgttttcac 


1620 


ggctgctgga 


aggcggtggt 


gggaagactt 


aaaagtgtaa 


catctgtaaa 


aaaatgcgtg 


1680 


tttgagcggt 


gtgtgttggc 


tttaactgtg 


gagggctgtg 


gacgcattag 


gaataatgcg 


1740 
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y L. y LL.Lya.ya. 


ct l y y ct l y lll 


LLLLLLyLL-Cl 


aaaggcaegg 


ctagtattaa 


gcataacatg 


1800 


a ta L y Ly y Let 


rr rrrrrt* efcert a 

y l« y w * — c- y C Q. 


L. L-rf CI. . ^ 


c tgttaactt 


acaccfcfatcfcf 


aaactgtcag 


1860 


a Z" 1 1" 1 "t~ fT f~" f~f C~* P\ 

aOU L uy LyLa 


fTTri - rr ^ rat" 

1 . ( . U UU O Ct 1 — • Ct I— 


~~~tCfC'CT\~C~C'CPi C 

CtyLy L.LLLCIL 


cagcgccgcg 


cctggccaac 


attcgagcac 


1920 


aa. L. d L y lluq 


1" rr c rr 1 t cr t cr c 
i — y 1 — ■ y l l y y 


cgt cca ct t g 


ggcect aggc 


gaggcgtgtt 


tgtgccttac 


1980 


O d LJ L y LdaU L 


L L Ct y L Let 1— Ct L 


C~ Pi ^ CI ?\ \~ ~\~ \~ t~ 
L ct ct y CL 1— ^. — I — I — Ci 


ctaaaaccta 

C-* C* Ci \J Ci Ci V—. w u y 


at acctt ct c 


tcgagtgtgt 


2040 


1~"t"r*?5a't"Cfcrcfcr 

c. c 1 — • a. d > — y y y y 


tgtttgacat 


gt caatggaa 


ctgtttaaag 


tgataagata 


tgatgaatcc 


2100 


ct ct y LLULy i— L 


y Luy ui-.La uy 


tcraatcrccrcfa 

1 — y ct a. y 1 — • y y ct 


gctaat cat c 


tgaggttgta 


tcctgtaacc 


2160 


L L ci ct ct L y L La 


prrfarrrfanrl - 
* ■ 1 — • y ct y y ct y ' — - c- 


era cfcraccrcrafc 

y Ci j ci y \J ci 


caeca cat gt 


tgtcctgcct 


gcgcaccgac 


2220 


L ci Ly ad LUL-a 


y l y ct l. »y ct y y ct 


n"t" rrarrn't'Cfacf 

y y et y y y ci 


gggeggagee 


acaaagggta 


taaaaaacfccr 


2280 




y~f 4— 4— /~f "H /t i~~ 

y Ly Ly ct Ly ct l 


l l ct ct ct ct Lycty 


rrrrrrra c a Pi ccr 

Ly y y ci l y ci l y 


gaeggcaacg 


cgttt gaggg 


2340 


Lyycty l y l l l 


CttjuL.Lt LCI L L 


uy ci l ca. 1 — • 1 — >j 


tcttccttcc 


t gggcaggag 


tgcgtcagaa 


2400 


4- rr "f~ 3 rr "h rr rt a c 
l y l ct y Ly yy l 


i~ c c p~ c~icicit~.es a 


acggacgacc 


ggt cgcccct 


gcaaatt ccg 


ccaccctcac 


2460 


/~i 4- 4- rr r~* 3 r" 1 1"* 
L Let L y I • *• . Ct L- 


n"H rr rrrrs i~ r*a i" 
y y y y ^ q. <— 


rciffcrcfacac 

Ly c uy y ci »c* 


tgccgcggca 


gctgccgctt 


ctgctgccgc 


2520 


4— 4— /—i 4- -a f* 4— s~r ,—i 4- 
L LLLaLL^LL. 


Ly Ly y Let l y y 


r*rrn i r , i"rfat"tt 

Ly y l uy o. u l. c 


tcrcractcrtat 

c» y y ct ^ — • c* y o ci c 


aaccaactgg 


ccactgcagc 


2580 


4~~ T* /^f 4~" t" 

Ly Lyy Ly ull 


Ly y LLLLLyy 


ra 3 rfaa era 
L- l Let ct y ct ct y ct 


"r"rrr , (~ , r*"t~n'aa't" 

C y Vw< L C- V^j Ci. Ci c* 


gtgatcctga 


ct cgcctgga 


2640 


yatCaLyLCa 


/— i <«r 4— t~* r-r f~* 4— 4— r~r f~r 

Ly LLyLLLyy 


ctLyctctLLyy l 


~\~ rr c~ ci c~ ~~~i cipi i~~ p\ 
l> y l y Let y a l ct 


trccaacrcta 

'-- U. C*. Ci Ci U C- Ci 


accccgatac 


2700 


L.aL l LLciy d.a 


L- L L c ct ct ci a. a. 


Ci Ci y ci > — - ci ci ci *w*ci 


aatatgttga 


aaagtaaaat 


ggctttattt 


2760 


„ 4- 4- 4- -h ■{- -h +- -h rf 

y LLLLLLLLy 


yLLLyy Lay y 


n't'ccrcrcftccia 

L C ^ — k — j '■-j "wj C. U U- Ci 


cct gt ct egg 


tcgttaagaa 


ctttgtgtat 


2820 


gttttccaaa 


acaeggtaca 


aataaactta 


gatgttcaag 


tacatgggca 


tgaggecate 


2880 


tttcrcrcfcrtcra. 

C- C- C- y ^ ^ C- Ci 


agataggacc 


attgaagagc 


gtcatgctcc 


ggggtggtgt 


tgtaaattac 


2940 


i^uau L>^y c ci y 


cac/crcrtttct 


gggcgtggaa 


ctggaagatg 


tcctttagga 


gtaggctgat 


3000 


yy LLet cty y y l 


ctyyLLLL ct y 


frit" a cr cr t crtt 

C- C. Ci ^ N-j C- V-j C 


tacaaagegg 


ttaagctggg 


agggatgeat 


3060 


>"t i^r /^f /T /'w j^f ^ 


4— r-t a I- a !■ /-r /— i u 
d Ly d Ld Ly Let 


lll Ly y ll c. y 


ctPi'\~~c~\~~~]~~c~~PiCiC{ 

y ct c l Lyay y 


ttacrctatcrt 


taccacccag 


3120 


y LLLLuy Lyy 


yyy l l Let l y l 


+*3"t~cj , aacj'crac: 

c ci c y a. ci. y y ci u 


ca ccagca eg 


at eft a crc cct cf 


tgcatttggg 


3180 


y ct ct l l l y u- > — ■ ct 


1~ a c f~ cii~ ~\~,~\r.cs ci 

i-. y Let y ^yy 


acrcfcfcraacrcrc 

t-4 y »w| ^ \£ ^ y v *~* 


gtggaagaat 


ttagagaccc 


ccttgtggcc 


3240 


ccctaggttt 


tccatgcact 


catccataat 


gatggcaatg 


ggacccctgg 


cggccgcttt 


o o n r\ 


ggcaaacacg 


ttttgggggt 


tggaaacatc 


atagttttgc 


tctagagtga 


gctcatcata 


3360 


ggecatctta 


acaaageggg 


gtaggagggt 


gcccgactgg 


gggatgatag 


ttccatctgg 


3420 


gcctggggcg 


tagttaccct 


cacagatctg 


catctcccag 


gecttaattt 


ccgagggggg 


3480 
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tatcatgtcc 


acctgggggg 


caataaagaa 


cacggtttct 


ggcgggggat 


tgatgagctg 


3540 


ggtggaaagc 


aagttacgca 


gcagttgaga 


tttgccacag 


ccggtggggc 


cgtagatgac 


3600 


cccgatgacg 


ggttgcagct 


ggtagttgag 


agaggaacag 


ctgccgtcgg 


ggcgcaggag 


3660 


gggggctacc 


tcattcatca 


tgcttctaac 


atgtttattt 


tcactcacta 


agttttgcaa 


3720 


gagcctctcc 


ccacccaggg 


ataagagttc 


ttccaggctg 


ttgaagtgtt 


tcagcggttt 


3780 


taggccgtcg 


gccatgggca 


tcttttcgag 


cgactgacga 


agcaagtaca 


gtcggtccca 


3840 


gagctcggtg 


acgtgctcta 


tggaatctcg 


atccagcaga 


cttcttggtt 


gcgggggttg 


3900 


ggtcgacttt 


cgctgtaggg 


caccagccgg 


tgggcgtcca 


gggccgcgag 


ggttctgtcc 


3960 


ttccagggtc 


tcagcgtccg 


ggtgagggtg 


gtctcggtga 


cggtgaaggg 


atgagccccg 


4020 


ggctgggcgc 


ttgcgagggt 


gcgcttcagg 


ctcatcctgc 


tggtgctgaa 


gcggacgtcg 


4080 


tctccctgtg 


agtcggccag 


atagcaacga 


agcatgaggt 


cgtagctgag 


ggactcggcc 


4140 


gcgtgtccct 


tggcgcgcag 


ctttcccttg 


gaaacgtgct 


gacatttggt 


gcagtgcaga 


4200 


cattggaggg 


cgtagagttt 


gggggccagg 


aagaccgact 


cgggcgagta 


ggcgtcggct 


4260 


ccgcactgag 


cgcagacggt 


ctcgcactcc 


actagccacg 


tgagctcggg 


tttagcggga 


4320 


tcaaaaacca 


agttgcctcc 


attttttttg 


atgcgtttct 


taccttgcgt 


ttccatgagt 


4380 


ttgtggcccg 


cttccgtgac 


aaaaaggctg 


tcggtgtctc 


cgtagacaga 


cttgaggggg 


4440 


cgatcttcca 


aaggtgttcc 


gaggtcttcc 


gcgtacagga 


actgggacca 


ctccgagacg 


4500 


aaggctctgg 


tccaggctaa 


cacgaaggag 


gcaatctgcg 


aggggtatct 


gtcgttttca 


4560 


atgagggggt 


ccaccttttc 


cagggtgtgc 


agacacaggt 


cgtcctcctc 


cgcgtccacg 


4620 


aaggtgattg 


gcttgtaagt 


gtaggtcacg 


tgatctgcac 


cccccaaagg 


ggtataaaag 


4680 


ggggcgtgcc 


caccctctcc 


gtcactttct 


tccgcatcgc 


tgtggaccag 


agccagctgt 


4740 


tcgggtgagt 


aggccctctc 


aaaagccggc 


atgatctcgg 


cgctcaagtt 


gtcagtttct 


4800 


acaaacgagg 


tggatttgat 


attcacgtgc 


cccgcggcga 


tgcttttgat 


ggtggagggg 


4860 


tccatctgat 


cagaaaacac 


gatctttttg 


ttgtcaagtt 


tggtggcgaa 


agacccgtag 


4920 


agggcgttgg 


aaagcaactt 


ggcgatggag 


cgcagggtct 


gatttttctc 


ccgatcggcc 


4980 


ctctccttgg 


cggcgatgtt 


gagttgcacg 


tactcccggg 


ccgcgcaccg 


ccactcgggg 


5040 


aacacggcgg 


tgcgctcgtc 


gggcaggatg 


cgcacgcgcc 


agccgcgatt 


gtgcagggtg 


5100 


atgaggtcca 


cgctggtagc 


cacctccccg 


cggaggggct 


cgttggtcca 


acacaatcgc 


5160 
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r* r* r* f i~ i~ 


+~ fTfra rrra rraa 
uy y c*. y < — y c* a 


rrrcracfcfcacfcf 


ggat ctagca 


aattcr qcqqq 

«-*y Wfc, *3i3 333 


cqqgqqgtcq 

ZJ ZJ ZJ Z> ZJ ZJ ZJ 


5220 


rfr , cr , hr'*rra1~rrrf 
y i — • y *^ y a »— y y 


tgaagatacc 


crcrcrtacf cacfer 

yyy ccty ^ci y ^3 


at cttattaa 


aataatcgat 


ttcggtgtcc 


5280 


U 1 — y 1— >_ 1— I— y * — • C4. 


acgcgtcttc 


ccactt cttc 


accgccaggg 


ccctttcgta 


gggattcagg 


5340 


cr cr ccf eft cccc 

y y ^— - y y 1 — - v-» w 


acrcfcrcatcfcfLT 
a y y y ^-c* ^-yyy 


Cf t CT CT Cf t c a Cf CT 


acccracTQCCft 


acatgccgca 


gatgtcatac 


5400 


dL-Lj Let Lay y y 


y L L L L L LL.au 


(-> rrrccfat cr 

a 1 — - v_» • y ci c y 


taacrtaacrCTt 


aacagcgccc 


cccgcggatg 


5460 


L L y y L LLLj Let 


L y Lay LL-y La 


paf- ct CCf CCf C 


gagggageca 


tgaggccgtc 


tcccaagtgg 


5520 


y ULLLy Ly y y 


y LLLLLLyyL 


c rrr rri - a era cr rf 
LLyy a y a y y 


atctatctaa 


aaatacrcatcf 

*-* y « y ^3 zy zs 


ggagttggaa 


5580 


y ay a Ly y Ly y 


y y Ly LLyyaa 


cfaccfttaaacr 

y cl ^w. y c • cl cl cl y 


ttggccccgg 


gtagt cccac 


qqaqtcttqg 

ZJ ZJ ZJ w 


5640 


a t~cra a r*t crcf cr 

cl ' — y a a 1 — • i— y y y 


egtaggatte 


ccggagtttg 


tccaccaggg 


cqqcggtcac 

ZJ ZJ ZJ -J 


cagcacgtcg 


5700 


cty ay ^y Lay l 


acftccaaccf t 


ctcgcggacc 


a era 1 1 cr t a ctq 


ccgtctcttg 


ttttttctcc 


5760 


facarft^'hrTrr* 

^ — cl \ — . a y i — \ — y ^ 


crcrttcfacfcracf 

yy i, uyay yay 


gta tt cct eg 


cggtctttcc 


agtactcttc 
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agtceggega 


ggagagaccc 


6300 
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tcctccttct 


ctgggtggac 


gacggtgggg 


acgacgacgc 


cccgggtgcc 


gcaagtccag 


6960 


atctccgcca 


cggaggggtg 


caggcgctgc 


aggaggggac 


gcagctgccc 


gctgtccagg 


7020 


gagtcgaggg 


aagtcgcgct 


gaggtcggcg 


ggaagcgttt 


gcaagttcac 


tttcagaaga 


7080 


ccggtaagag 


cgtgagccag 


gtgcagatgg 


tacttgattt 


ccaggggggt 


gttggatgaa 


7140 


gcgtccacgg 


cgtagaggag 


tccgtgtccg 


cgcggggcca 


ccaccgtgcc 


ccgaggaggt 


7200 


tttatctcac 


tcgtcgaggg 


cgagcgccgg 


ggggtagagg 


cggctctgcg 


ccggggggca 


7260 


gcggaggcag 


aggcacgttt 


tcgtgaggat 


tcggcagcgg 


ttgatgacga 


gcccggagac 


7320 


tgctggcgtg 


ggcgacgacg 


cggcggttga 


ggtcctggat 


gtgccgtctc 


tgcgtgaaga 


7380 


ccaccggccc 


ccgggtcctg 


aacctaaaga 


gagttccaca 


gaatcaatgt 


ctgcatcgtt 


7440 


aacggcggcc 


tgcctgagga 


tctcctgcac 


gtcgcccgag 


ttgtcctgat 


aggcgatctc 


7500 


ggccatgaac 


tgttccactt 


cttcctcgcg 


gaggtcaccg 


tggcccgctc 


gctccacggt 


7560 


ggcggccagg 


tcgttggaga 


tgcggcgcat 


gagttgagag 


aaggcgttga 


ggccgttctc 


7620 


gttccacacg 


cggctgtaca 


ccacgtttcc 


gaaggagtcg 


cgcgctcgca 


tgaccacctg 


7680 


ggccacgttg 


agttccacgt 


ggcgggcgaa 


gacggcgtag 


tttctgaggc 


gctggaagag 


7740 


gtagttgagc 


gtggtggcga 


tgtgctcgca 


gacgaagaag 


tacataatcc 


agcgccgcag 


7800 


ggtcatctcg 


ttgatgtctc 


cgatggcttc 


gagacgctcc 


atggcctcgt 


agaagtcgac 


7860 


ggcgaagttg 


aaaaattggg 


agttgcgggc 


ggccaccgtg 


agttcttctt 


gcaggaggcg 


7920 


gatgagatcg 


gcgaccgtgt 


cgcgcacctc 


ctgttcgaaa 


gcgccccgag 


gcgcctctgc 


7980 


ttcttcctcc 


ggctcctcct 


cttccagggg 


ctcgggttcc 


tccggcagct 


ctgcgacggg 


8040 


gacggggcgg 


cgacgtcgtc 


gtctgaccgg 


caggcggtcc 


acgaagcgct 


cgatcatttc 


8100 


gccgcgccgg 


cgacgcatgg 


tctcggtgac 


ggcgcgtccg 


ttttcgcgag 


gtcgcagttc 


8160 


gaagacgccg 


ccgcgcagag 


cgcccccgtg 


cagggagggt 


aagtggttag 


ggccgtcggg 


8220 


cagggacacg 


gcgctgacga 


tgcattttat 


caattgctgc 


gtaggcactc 


cgtgcaggga 
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tcgaggtcga 


cgggatccga 


gaacttctct 


aggaaagcgt 


ctatccaatc 


8340 


gcaatcgcaa 


ggtaagctga 


gaacggtggg 


tcgctggggg 


gcgttcgcgg 


gcagttggga 


8400 


ggtgatgctg 


ctgatgatgt 


aattaaagta 


ggcggtcttc 


aggcggcgga 


tggtggcgag 
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gaggaccacg 


tctttgggcc 


cggcctgttg 


aatgcgcagg 


cgctcggcca 


tgccccaggc 


8520 


ctcgctctga 


cagcgacgca 


ggtctttgta 


gaagtcttgc 


atcagtctct 


ccaccggaac 


8580 
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gaatcgttgg aagacgtacg cccaggagca gcgggagtgg gaggacagtc agccgcggcc 12060 

gctggtaccg ccgcattggc gtcgccagag agaagacccg gacgactccg cagacgatag 12120 

tagcgtgttg gacctgggag ggagcggagc caaccccttt gctcacttgc aacccaaggg 12180 

gcgctcgagt cgcctgtatt aataaaaaag acgcggaaac ttaccagagc catggccaca 12240 

gcgtgtgtgc tttcttcctc tctttcttcc tcggcgcggc aga atg aga aga gcg 12295 

Met Arg Arg Ala 
1 

gtg aga gtc acg ccg gcg gcg tat gag ggc ccg ccc cct tct tac gaa 12343 
Val Arg Val Thr Pro Ala Ala Tyr Glu Gly Pro Pro Pro Ser Tyr Glu 
5 10 15 20 

age gtg atg gga tea gcg aac gtg ccg gec acg ctg gag gcg cct tac 12391 
Ser Val Met Gly Ser Ala Asn Val Pro Ala Thr Leu Glu Ala Pro Tyr 
25 30 35 

gtt cct ccc aga tac ctg gga cct acg gag ggc aga aac age ate cgt 12439 
Val Pro Pro Arg Tyr Leu Gly Pro Thr Glu Gly Arg Asn Ser lie Arg 
40 45 50 

tac tec gag ctg gcg ccc ctg tac gat ace acc aag gtg tac ctg gtg 12487 
Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Lys Val Tyr Leu Val 
55 60 65 

gac aac aag teg gcg gac ate gee tec ctg aat tac caa aac gat cac 12535 
Asp Asn Lys Ser Ala Asp lie Ala Ser Leu Asn Tyr Gin Asn Asp His 
70 75 80 

agt aac ttt ctg act acc gtg gtg cag aac aat gac ttc acc ccg acg 12583 
Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp Phe Thr Pro Thr 
85 90 95 100 

gag gcg ggc acg cag acc att aac ttt gac gag cgt tec cgc tgg ggc 12631 
Glu Ala Gly Thr Gin Thr He Asn Phe Asp Glu Arg Ser Arg Trp Gly 
105 110 115 

ggt cag ctg aaa acc ate ctg cac acc aac atg ccc aac ate aac gag 12679 
Gly Gin Leu Lys Thr He Leu His Thr Asn Met Pro Asn He Asn Glu 
120 125 130 

ttc atg tec acc aac aag ttc agg get aag ctg atg gta gaa aaa agt 12727 
Phe Met Ser Thr Asn Lys Phe Arg Ala Lys Leu Met Val Glu Lys Ser 
135 140 145 

aat gcg gaa act egg cag ccc cga tac gag tgg ttc gag ttt acc att 12775 
Asn Ala Glu Thr Arg Gin Pro Arg Tyr Glu Trp Phe Glu Phe Thr He 
150 155 160 

cca gag ggc aac tat tec gaa act atg act ate gat etc atg aat aac 12823 
Pro Glu Gly Asn Tyr Ser Glu Thr Met Thr He Asp Leu Met Asn Asn 
165 170 175 180 
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gcg ate gtg gac aat tac ctg caa gtg ggg aga cag aac ggg gtg ctg 12871 
Ala He Val Asp Asn Tyr Leu Gin Val Gly Arg Gin Asn Gly Val Leu 
185 190 ~ 195 

gaa age gat ate ggc gtg aaa ttc gat acc aga aac ttc cga ctg ggg 12919 
Glu Ser Asp He Gly Val Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly 
200 205 210 

tgg gat ccc gtg acc aag ctg gtg atg cca ggc gtg tac acc aac gag 12967 
Trp Asp Pro Val Thr Lys Leu Val Met Pro Gly Val Tyr Thr Asn Glu 
215 220 225 

get ttt cac ccg gac ate gtg ctg ctg ccg ggg tgc ggt gtg gac ttc 13015 
Ala Phe His Pro Asp He Val Leu Leu Pro Gly Cys Gly Val Asp Phe 
230 235 240 

act cag age cgt ttg agt aac ctg tta gga att aga aag cgc cgc ccc 13063 
Thr Gin Ser Arg Leu Ser Asn Leu Leu Gly lie Arg Lys Arg Arg Pro 
245 250 " 255 ~ " ~ 260 

ttc caa gag ggc ttt caa ate atg tat gag gac ctg gag gga ggt aat 13111 
Phe Gin Glu Gly Phe Gin lie Met Tyr Glu Asp Leu Glu Gly Gly Asn 
265 270 275 

ata ccc gee tta ctg gac gtg teg aag tac gaa get age ata caa cgc 13159 
lie Pro Ala Leu Leu Asp Val Ser Lys Tyr Glu Ala Ser He Gin Arg 
280 285 ~ 290 

gee aaa gcg gag ggt aga gag att egg gga gac acc ttt gcg gta get 13207 
Ala Lys Ala Glu Gly Arg Glu He Arg Gly Asp Thr Phe Ala Val Ala 
295 300 305 

ccc cag gac ctg gaa ata gtg cct tta act aaa gac age aaa gac aga 13255 
Pro Gin Asp Leu Glu He Val Pro Leu Thr Lys Asp Ser Lys Asp Arg 
310 315 320 

age tac aat att ata aac aac acg acg gac acc ctg tat egg age tgg 13303 
Ser Tyr Asn He He Asn Asn Thr Thr Asp Thr Leu Tyr Arg Ser Trp 
325 330 335 340 

ttt ctg get tac aac tac gga gac ccc gag aaa gga gtg aga tea tgg 13351 
Phe Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly Val Arg Ser Trp 
345 350 355 

acc ata etc acc acc acg gac gtg acc tgt ggc teg cag caa gtg tac 13399 
Thr He Leu Thr Thr Thr Asp Val Thr Cys Gly Ser Gin Gin Val Tyr 
360 365 370 

tgg tec ctg ccg gat atg atg caa gac ccg gtc acc ttc cgc ccc tec 13447 
Trp Ser Leu Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg Pro Ser 
375 380 385 

acc caa gtc age aac ttc ccg gtg gtg ggc acc gag ctg ctg ccc gtc 13495 
Thr Gin Val Ser Asn Phe Pro Val Val Gly Thr Glu Leu Leu Pro Val 
390 395 400 
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cat gcc aag age ttc tac aac gag cag gec gtc tac teg caa ctt att 
His Ala Lys Ser Phe Tyr Asn Glu Gin Ala Val Tyr Ser Gin Leu lie 
405 410 415 420 



13543 



cgc cag tec acc gcg ctt acc cac gtg ttc aat cgc ttt ccc gag aac 
Arg Gin Ser Thr Ala Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn 
425 430 435 



13591 



cag att ctg gtg cgc cct ccc get cct acc att acc acc gtc agt gaa 
Gin lie Leu Val Arg Pro Pro Ala Pro Thr lie Thr Thr Val Ser Glu 
440 445 450 



13639 



aac gtt ccc gcc etc aca gat cac gga acc ctg ccg ctg cgc age agt 
Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Ser Ser 
455 460 465 



13687 



ate agt gga gtt cag cgc gtg acc ate acc gac gcc aga cgt ega acc 
lie Ser Gly Val Gin Arg Val Thr lie Thr Asp Ala Arg Arg Arg Thr 
470 475 480 



13735 



tgc ccc tac gtt tac aaa gcg ctt ggc gtg gtg get cct aaa gtt ctt 
Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala Pro Lys Val Leu 
485 " ~ 490 495 500 



13783 



tct agt cgc acc ttc taa aaacatgtcc atcctcatct ctcccgataa 
Ser Ser Arg Thr Phe 
505 



13831 



caacaccggc 


tggggactgg 


gctccggcaa 


gatgtaegge 


ggagccaaaa 


ggcgctccag 


13891 


tcagcaccca 


gttcgagttc 


ggggecaett 


ccgcgctcct 


tggggagctt 


acaagegagg 


13951 


actctegggt 


egaaeggctg 


tagacgatac 


catagatgee 


gtgattgccg 


acgcccgccg 


14011 


gtacaacccc 


ggaceggteg 


ctagcgccgc 


ctccaccgtg 


gattccgtga 


tegacagegt 


14071 


ggtagcegge 


gctcgggcct 


atgctcgccg 


caagaggegg 


ctgcatcgga 


gacgtcgccc 


14131 


caccgccgcc 


atgctggcag 


ccagggccgt 


getgaggegg 


geceggaggg 


caggcagaag 


14191 


ggctatgege 


cgcgctgccg 


ccaacgccgc 


egcegggagg 


gcccgccgac 


aggctgcccg 


14251 


ccaggctgcc 


gctgccatcg 


ctagcatggc 


cagacccagg 


agagggaacg 


tgtactgggt 


14311 


gcgtgattct 


gtgaegggag 


tccgagtgcc 


ggtgcgcagc 


cgacctcccc 


gaagttagaa 


14371 


gatccaagct 


gegaagaegg 


eggtactgag 


tctccctgtt 


gttatcagee 


caacatgagc 


14431 


aagcgcaagt 


ttaaagaaga 


actgetgeag 


acgctggtgc 


ctgagatcta 


tggccctccg 


14491 


gaegtgaage 


cagacattaa 


gccccgcgat 


ateaagegtg 


ttaaaaagcg 


ggaaaagaaa 


14551 


gaggaactcg 


cggtggtaga 


cgatggcgga 


gtggaattta 


ttaggagttt 


cgccccgcga 


14611 


cgcagggttc 


aatggaaagg 


gcggcgggta 


caacgcgttt 


tgaggccggg 


caccgcggta 


14671 


gtttttaccc 


egggagageg 


gtcggccgtt 


aggggtttca 


aaaggcagta 


cgacgaggtg 


14731 
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tacggcgacg 


aggacatatt 


ggaacaggcg 


gctcaacaga 


tcggagaatt 


tgcctacgga 


14791 


aagcgttcgc 


gtcgcgaaga 


cctggccatc 


gccttagaca 


gcggcaaccc 


cacgcccagc 


14851 


ctcaaacccg 


tgacgctgca 


gcaggtgctt 


cccgtgagcg 


ccagcacgga 


cagcaagagg 


14911 


gggattaaga 


gagaaatgga 


agatctgcat 


cccaccatcc 


aactcatggt 


ccctaaacgg 


14971 


cagaggctgg 


aagaggtcct 


ggagaagatg 


aaagtggacc 


ccagcataga 


gccggatgta 


15031 


aaagtcagac 


ctattaagga 


agtggccccc 


ggtcttgggg 


tgcaaacggt 


ggacattcaa 


15091 


atccccgtca 


ccaccgcttc 


aaccgccgtg 


gaagctatgg 


aaacgcaaac 


ggagacccct 


15151 


gccgcgatcg 


gtaccaggga 


agtggcgttg 


caaacggagc 


cttggtacga 


atacgcagcc 


15211 


cctcggcgtc 


agaggcgttc 


cgctcgttac 


ggccccgcca 


acgccatcat 


gccagaatat 


15271 


gcgctgcatc 


cgtctattct 


gcccactccc 


ggataccggg 


gtgtgacgta 


tcgcccgtct 


15331 


ggaacccgcc 


gccgaacccg 


tcgccgccgc 


cgctcccgtc 


gcgctctggc 


ccccgtgtcg 


15391 


gtgcggcgtg 


tgacccgccg 


gggaaagaca 


gtcgtcattc 


ccaacccgcg 


ttaccaccct 


15451 


agcatccttt 


aataactctg 


ccgttttgca 


gatggctctg 


acttgccgcg 


tgcgccttcc 


15511 


cgttccgcac 


tatcgaggaa 


gatctcgtcg 


taggagaggc 


atgacgggca 


gtggtcgccg 


15571 


gcgggctttg 


cgcaggcgca 


tgaaaggcgg 


aattttaccc 


gccctgatac 


ccataattgc 


15631 


cgccgccatc 


ggtgccatac 


ccggcgttgc 


ttcagtggcg 


ttgcaagcag 


ctcgtaataa 


15691 


ataaacaaag 


gcttttgcac 


ttatgacctg 


gtcctgacta 


ttttatgcag 


aaagagcatg 


15751 


gaagacatca 


attttacgtc 


gctggctccg 


cggcacggct 


cgcggccgct 


catgggcacc 


15811 


tggaacgaca 


tcggcaccag 


tcagctcaac 


gggggcgctt 


tcaattgggg 


gagcctttgg 


15871 


agcggcatta 


aaaactttgg 


ctccacgatt 


aaatcctacg 


gcagcaaagc 


ctggaacagt 


15931 


agtgctggtc 


agatgctccg 


agataaactg 


aaggacacca 


acttccaaga 


aaaagtggtc 


15991 


aatggggtgg 


tgaccggcat 


ccacggcgcg 


gtagatctcg 


ccaaccaagc 


ggtgcagaaa 


16051 


gagattgaca 


ggcgtttgga 


aagctcgcgg 


gtgccgccgc 


agagagggga 


tgaggtggag 


16111 


gtcgaggaag 


tagaagtaga 


ggaaaagctg 


cccccgctgg 


agaaagttcc 


cggtgcgcct 


16171 


ccgagaccgc 


agaagcggcc 


caggccagaa 


ctagaagaga 


ctctggtgac 


ggagagcaag 


16231 


gagcctccct 


cgtacgagca 


agccttgaaa 


gagggcgcct 


ctccaccctc 


ctacccgatg 


16291 


actaagccga 


tcgcacccat 


ggctcgaccg 


gtgtacggca 


aggattacaa 


gcccgtcacg 


16351 


ctagagctgc 


ccccaccgcc 


ccccacgcgc 


ccgaccgtcc 


cccccctgcc 


gactccgtcg 


16411 
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gcggccgcgg cgggacccgt gtccgcacca tccgctgtgc ctctgccagc cgcccgtcca 16471 

gtggccgtgg ccactgccag aaaccccaga ggccagagag gagccaactg gcaaagcacg 16531 

ctgaacagca tcgtgggcct gggagtgaaa agcctgaaac gccgccgttg ctattattaa 16591 

aaaagtgtag ctaaaaagtc tcccgttgta tacgcctcct atgttaccgc cagagacgag 16651 

tgactgtcgc cgcgagcgcc gctttcaag atg gcc acc cca teg atg atg ccg 16704 

Met Ala Thr Pro Ser Met Met Pro 
510 

cag tgg tct tac atg cac ate gcc ggc cag gac gcc teg gag tac ctg 16752 

Gin Trp Ser Tyr Met His lie Ala Gly Gin Asp Ala Ser Glu Tyr Leu 
515 520 525 



agt ccc ggc etc gtg cag ttt gcc cgc gcc acc gac acc tac ttc age 

Ser Pro Gly Leu Val Gin Phe Ala Arg Ala Thr Asp Thr Tyr Phe Ser 

530 ~ 535 540 545 

ttg gga aac aag ttt aga aac ccc acc gtg gcc ccc acc cac gat gtg 

Leu Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro Thr His Asp Val 

550 555 560 

acc acg gac cgc teg cag agg ctg acc ctg cgc ttt gtg ccc gta gac 

Thr Thr Asp Arg Ser Gin Arg Leu Thr Leu Arg Phe Val Pro Val Asp 

565 570 575 



ggg gtg ctg gat egg ggt ccc age ttc aag ccc tat tec ggc acc get 
Gly Val Leu Asp Arg Gly Pro Ser Phe Lys Pro Tyr Ser Gly Thr Ala 
610 615 620 625 



16800 



16848 



16896 



egg gag gac acc gcg tac tct tac aaa gtg cgc tac acg ttg gcc gta 16944 
Arg Glu Asp Thr Ala Tyr Ser Tyr Lys Val Arg Tyr Thr Leu Ala Val 
580 585 590 

ggg gac aac cga gtg ctg gac atg gcc age acc tac ttt gac ate egg 16992 
Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr Phe Asp lie Arg 
595 600 605 



17040 



tac aac tec ctg gcc ccc aag gga get ccc aac ccc teg gaa tgg acg 17088 

Tyr Asn Ser Leu Ala Pro Lys Gly Ala Pro Asn Pro Ser Glu Trp Thr 

630 635 64 0 

gac act tec gac aac aaa ctt aaa gca tat get cag get ccc tac cag 17136 

Asp Thr Ser Asp Asn Lys Leu Lys Ala Tyr Ala Gin Ala Pro Tyr Gin 
645 650 655 

agt caa gga ctt aca aag gat ggt att cag gtt ggg eta gtt gtg aca 17184 

Ser Gin Gly Leu Thr Lys Asp Gly lie Gin Val Gly Leu Val Val Thr 
660 665 670 

gag tea gga caa aca ccc caa tat gca aac aaa gtg tac caa ccc gag 17232 

Glu Ser Gly Gin Thr Pro Gin Tyr Ala Asn Lys Val Tyr Gin Pro Glu 

675 ^ 680 685 
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cca caa att 

Pro Gin lie 
690 

gcg gga aga 

Ala Gly Arg 



tea tat gec 
Ser Tyr Ala 



gaa gta gat 
Glu Val Asp 
740 

get aaa gtg 
Ala Lys Val 
755 

act cac tta 

Thr His Leu 
770 

ctt ctt ggg 

Leu Leu Gly 



aga gat aat 
Arg Asp Asn 



ggc gtg ctg 
Gly Val Leu 
820 

cag gac aga 
Gin Asp Arg 
835 

ggg gat cgc 
Gly Asp Arg 
850 

tat gac cca 
Tyr Asp Pro 



ctg cca aac 
Leu Pro Asn 



tac aac tct 
Tyr Asn Ser 
900 



ggg gaa aac 
Gly Glu Asn 
695 

gtc eta aag 
Val Leu Lys 
710 

agg ccc aca 

Arg Pro Thr 
725 

tta cag ttt 

Leu Gin Phe 



gta ctt tat 
Val Leu Tyr 



gtg ttt aaa 
Val Phe Lys 
775 

cag cag get 
Gin Gin Ala 
790 

ttt att ggt 

Phe lie Gly 
805 

gec gga cag 

Ala Gly Gin 



aac ace gag 
Asn Thr Glu 



age cga tat 
Ser Arg Tyr 
855 

gac gtt aga 
Asp Val Arg 
870 

tat tgt ttt 
Tyr Cys Phe 
885 

gtg acg cct 
Val Thr Pro 



caa tgg aat 
Gin Trp Asn 



aaa gat ace 
Lys Asp Thr 



aac gaa caa 
Asn Glu Gin 
730 

ttt gee act 
Phe Ala Thr 
745 

get gaa aat 
Ala Glu Asn 
760 

ccc gat gac 
Pro Asp Asp 



gca cct aac 
Ala Pro Asn 



tta atg tac 
Leu Met Tyr 
810 

get tct caa 
Ala Ser Gin 
825 

ttg tec tac 
Leu Ser Tyr 
840 

ttt tea atg 
Phe Ser Met 



att ata gaa 
He He Glu 



cct ctg gga 
Pro Leu Gly 
890 

caa aat gga 
Gin Asn Gly 
905 



tta gaa caa 
Leu Glu Gin 
700 

cct atg ttt 
Pro Met Phe 
715 

gga ggg cag 
Gly Gly Gin 



ccg ggc gac 
Pro Gly Asp 



gtc aac ctg 
Val Asn Leu 
765 

gac age acc 
Asp Ser Thr 
780 

aga ccc aac 
Arg Pro Asn 
795 

tac aat age 
Tyr Asn Ser 



ttg aat gee 
Leu Asn Ala 



cag ctg atg 
Gin Leu Met 
845 

tgg aat cag 
Trp Asn Gin 
860 

aac cac gga 
Asn His Gly 
875 

gga atg gtg 
Gly Met Val 



ggc agt gga 
Gly Ser Gly 



gaa gat aaa 
Glu Asp Lys 



ccc tgc tat 
Pro Cys Tyr 
720 

gca aaa aac 
Ala Lys Asn 
735 

acc cag aac 
Thr Gin Asn 
750 

gaa act cca 
Glu Thr Pro 



agt tea aaa 
Ser Ser Lys 



tac ata ggt 
Tyr He Gly 
800 

act gga aac 
Thr Gly Asn 
815 

gta gtc gac 
Val Val Asp 
830 

ctg gac gca 
Leu Asp Ala 



gca gta gac 
Ala Val Asp 



gtg gaa gac 
Val Glu Asp 
880 

gtg act gac 
Val Thr Asp 
895 

aat aca tgg 
Asn Thr Trp 
910 



gcg 17280 

Ala 

705 

ggg 17328 
Gly 



caa 17376 
Gin 



acg 17424 
Thr 



gat 17472 
Asp 



ctt 17520 

Leu 

785 

ttt 17568 
Phe 



atg 17616 
Met 



ttg 17664 
Leu 



ctg 17712 
Leu 



age 17760 

Ser 

865 

gaa 17808 
Glu 



aat 17856 
Asn 



cag 17904 
Gin 
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gca gac aat act aca ttt agt caa aga gga gcg cag att ggc tec gga 17952 
Ala Asp Asn Thr Thr Phe Ser Gin Arg Gly Ala Gin lie Gly Ser Gly 
915 920 925 



aac atg ttt gec ctg gaa att aac eta cag gec aac etc tgg cgc ggc 
Asn Met Phe Ala Leu Glu lie Asn Leu Gin Ala Asn Leu Trp Arg Gly 
930 935 940 945 



atg aac ggt cgc gta acg cca ccc ggg etc ata gac ace tat gta aac 
Met Asn Gly Arg Val Thr Pro Pro Gly Leu He Asp Thr Tyr Val Asn 
980 985 990 



ggc 



gac 



acg 



cag 



18000 



ttc ttg tat tec aat att ggg ttg tat ctt cca gac tct ctg aaa ate 18048 

Phe Leu Tyr Ser Asn He Gly Leu Tyr Leu Pro Asp Ser Leu Lys lie 
950 955 960 

acc ccc gac aac ate acg ctg cca gaa aac aaa aac act tat cag tac 18096 

Thr Pro Asp Asn lie Thr Leu Pro Glu Asn Lys Asn Thr Tyr Gin Tyr 

965 970 975 



18144 



gtg ggc gcg cgc tgg tec ccc gat gtc atg gac age att aac ccc ttc 18192 
Val Gly Ala Arg Trp Ser Pro Asp Val Met Asp Ser He Asn Pro Phe 
995 ~ 1000 1005 

aac cac cac cgt aac gcg ggc ttg cgc tac cgc tec atg etc ttg 18237 
Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu 
1010 1015 1020 



aac ggc cgt tat gtg cct ttt cac att cag gtg ccc caa aaa 18282 
Gly Asn Gly Arg Tyr Val Pro Phe His He Gin Val Pro Gin Lys 
1025 ~ ~ 1030 1035 



ttc ttt gee att aaa aac ctg ctg ctt etc ccc ggt tec tat acc 18327 

Phe Phe Ala He Lys Asn Leu Leu Leu Leu Pro Gly Ser Tyr Thr 

1040 1045 1050 

tat gag tgg aac ttc cgc aag gat gtc aac atg ate ctg cag age 18372 

Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met He Leu Gin Ser 

1055 1060 1065 

teg ctg ggt aat gac ctg cga gtg gac ggg gee age ata cgc ttt 18417 

Ser Leu Gly Asn Asp Leu Arg Val Asp Gly Ala Ser He Arg Phe 

1070 1075 1080 



age att aac ctg tat gee aac ttt ttt ccc atg gee cac aac 18462 
Asp Ser He Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala His Asn 
1085 1090 1095 



gee tct acc ctg gaa gee atg ctg cgc aac gac acc aat gac 18507 
Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp 
1100 1105 1110 



tec ttc aac gac tac ctg tgc gcg get aac atg ctg tac ccc 18552 
Gin Ser Phe Asn Asp Tyr Leu Cys Ala Ala Asn Met Leu Tyr Pro 
1115 1120 1125 
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ate ccc gee aac gec acc age gtg ccc att tct att cct tct egg 
lie Pro Ala Asn Ala Thr Ser Val Pro He Ser He Pro Ser Arg 
1130 1135 1140 



aac tgg get gee ttc agg ggc tgg agt ttt act 
Asn Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr 
1145 *" 1150 1155 



cgc etc aaa acc 
Arg Leu Lys Thr 



18597 



18642 



aag gag act ccc teg ctg ggc tec ggt ttt gac ccc tac ttt gtt 
Lys Glu Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val 
1160 1165 1170 



tac tec ggc tec att ccc 
Tyr Ser Gly Ser He Pro 
1175 



tac eta gat ggc acc ttt tac etc aac 
Tyr Leu Asp Gly Thr Phe Tyr Leu Asn 
1180 1185 



cac act ttc aaa aag gtg tct att atg ttt gac tec teg gtt age 
His Thr Phe Lys Lys Val Ser He Met Phe Asp Ser Ser Val Ser 
1190 1195 1200 



tgg ccc ggc aac gac cgc ctg eta acg ccc aac 
Trp Pro Gly Asn Asp Arg Leu Leu Thr Pro Asn 
1205 1210 1215 



gag ttc gaa att 
Glu Phe Glu He 



aag cgt tec gtg gac ggt 
Lys Arg Ser Val Asp Gly 
1220 1225 



gaa ggg tac aac gtg gee cag age aac 
Glu Gly Tyr Asn Val Ala Gin Ser Asn 
1230 



atg acc aag gac tgg ttt eta att caa atg etc agt cac tat aat 

Met Thr Lys Asp Trp Phe Leu He Gin Met Leu Ser His Tyr Asn 
1235 " 1240 1245 

ata ggt tac cag ggc ttc tat gtg ccc gag aac tac aag gac cgc 

He Gly Tyr Gin Gly Phe Tyr Val Pro Glu Asn Tyr Lys Asp Arg 
1250 " ~ 1255 1260 

atg tac tec ttc ttc cgc aac ttc caa cca atg age egg cag gtg 

Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg Gin Val 
12 65 " 127 0 1275 

gta gat acc gtg act tat aca gac tac aaa gat gtc aag etc ccc 

Val Asp Thr Val Thr Tyr Thr Asp Tyr Lys Asp Val Lys Leu Pro 
1280 " 1285 1290 

tac caa cac aac aac tea ggg ttc gtg ggc tac atg gga ccc acc 

Tyr Gin His Asn Asn Ser Gly Phe Val Gly Tyr Met Gly Pro Thr 
1295 1300 1305 



atg cga gag gga cag gee 

Met Arg Glu Gly Gin Ala 
1310 1315 

ate gga gag act gee gta 

He Gly Glu Thr Ala Val 
1325 1330 



tac ccg gee aac tat ccc tac ccc ctg 
Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu 
132 0 



ccc age etc acg cag 
Pro Ser Leu Thr Gin 
1335 



aaa aag ttc etc 
Lys Lys Phe Leu 



18687 



18732 



18777 



18822 



18867 



18912 



18957 



19002 



19047 



19092 



19137 



19182 
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tgc gac egg gtg atg tgg agg ata ccc ttc tct age aac ttt atg 19227 

Cys Asp Arg Val Met Trp Arg lie Pro Phe Ser Ser Asn Phe Met 

1340 1345 1350 

teg atg ggc tec etc ace gac ctg ggg cag aac atg ctg tac gee 19272 

Ser Met Gly Ser Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala 

1355 1360 1365 

aac tec get cac gee ttg gac atg act ttt gag gtg gat ccc atg 19317 

Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu Val Asp Pro Met 

1370 1375 1380 

gat gag ccc acg ctt etc tat gtt ctg ttt gaa gtc ttc gac gtg 19362 

Asp Glu Pro Thr Leu Leu Tyr Val Leu Phe Glu Val Phe Asp Val 

1385 1390 1395 

gtg cgc ate cac cag ccg cac cgc ggc gtc ate gag gee gtc tac 19407 

Val Arg lie His Gin Pro His Arg Gly Val lie Glu Ala Val Tyr 

1400 1405 1410 

ctg cgc aca cct ttc tct gee ggt aac gee ace ace taa agaagctgat 19456 
Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr 

1415 1420 1425 



rtrtt t '1" c* nr* 

y y y l. Luoay 






t gttcgegae 


ctacractacq 


ggccctgctt 


19516 


tttgggcacc 


ttcgacaagc 


gttttccegg 


attcatgtcc 


ccccacaagc 


cggcctgcgc 


iy j / o 


categttaac 


acggccggac 


gggagacagg 


gggggtgcac 


tggctcgcct 


tcgcctggaa 


19636 


cccgcgcaac 


cgcacctgct 


acctgttcga 


cccttttggt 


ttctccgacg 


aaaggctgaa 


19696 


gcagatctac 


caattcgagt 


acgaggggct 


cctcaagcgc 


agegctctgg 


cctccacgcc 


19756 


cgaccactgc 


gtcaccctgg 


aaaagtccac 


ccagacggtc 


caggggcccc 


tctcggccgc 


19816 


ctgegggett 


ttctgttgca 


tgtttttgca 


cgccttcgtg 


cactggcctc 


acacccccat 


19876 


ggagegcaac 


cccaccatgg 


atctgctcac 


cggagtgccc 


aacagcatgc 


ttcacagtcc 


19936 


ccaggtcgcc 


cccaccctgc 


gtegcaatea 


ggaccacctg 


tategcttte 


tggggaaaca 


19996 


ctctgcctat 


ttccgccgcc 


accggcagcg 


catcgaacag 


gccacggcct 


tcgaaagcat 


20056 


gagecaaaga 


gtgtaatcaa 


taaaaaccgt 


ttttatttga 


catgataege 


gcttctggcg 


20116 


tttttattaa 


aaatcgaagg 


gttcgaggga 


ggggtcctcg 


tgcccgctgg 


ggagggacac 


20176 


gttgeggtae 


tggaatcggg 


cgctccaacg 


aaactcgggg 


atcaccagcc 


geggcaggge 


20236 


cacgtcttcc 


atgttctgct 


tccaaaactg 


tcgcaccagc 


tgeagggetc 


ccatcacgtc 


20296 


gggegctgag 


atcttgaagt 


cgcagttagg 


gccggagccc 


ccgcggctgt 


tgeggaacac 


20356 


ggggttggca 


cactggaaca 


ccaacacgct 


ggggttgtgg 


atactageca 


gggcegtegg 


20416 


gtcggtcacc 


tecgatgeat 


ccagatcctc 


ggcattgetc 


agggegaacg 


gggtcagctt 


20476 
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gca'catctgc 


cgcccgatct 


ggggtaccag 


gtcgcgcttg 


ttgaggcagt 


cgcagcgcag 


20536 


agggatgagg 


atgcgacgct 


gcccgcgttg 


catgatgggg 


taactcgccg 


ccaggaactc 


20596 


ctctatctga 


cggaaggcca 


tctgggcctt 


gacgccctcg 


gtgaaaaata 


gcccacagga 


20656 


cttgctggaa 


aacacgttat 


tgccacagtt 


gatgtcttcc 


gcgcagcagc 


gcgcatcttc 


20716 


gttcttcagc 


tgaaccacgt 


tgcgacccca 


gcggttctga 


accaccttgg 


ctttcgtggg 


20776 


atgctccttc 


agcgcccgct 


gtccgttctc 


gctggtcaca 


tccatttcca 


ccacgtgctc 


20836 


cttgcagacc 


atctccactc 


cgtggaaaca 


gaacagaatg 


ccctcctgtt 


gggtattgcg 


20896 


atgctcccac 


acggcgcacc 


cggtggactc 


ccagctcttg 


tgtttcaccc 


ccgcgtaggc 


20956 


ttccatgtaa 


gccattagaa 


atctgcccat 


cagctcagtg 


aaggtcttct 


ggttggtgaa 


21016 


ggttagcggc 


aggccgcggt 


gttcctcgtt 


caaccaagtt 


tgacagatct 


tgcggtacac 


21076 


ggctccctgg 


tcgggcagaa 


acttaaaagt 


cgttctgctc 


tcgttgtcca 


cgtggaactt 


21136 


ctccatcaac 


atcgtcatga 


cttccatgcc 


cttctcccag 


gcagtcacca 


gcggcgcgct 


21196 


ctcggggttc 


ttcaccaaca 


cggcggtgga 


ggggccctcg 


ccggccccga 


cgtccttcat 


21256 


ggacattttt 


tgaaactcca 


cggtgccgtc 


cgcgcggcgt 


actctgcgca 


tcggagggta 


21316 


gctgaagccc 


acctccatga 


cggtgctttc 


gccctcgctg 


tcggagacga 


tctccgggga 


21376 


gggcggcgga 


acgggggcag 


acttgcgagc 


cttcttcttg 


ggagggagcg 


gaggcacctc 


21436 


ctgctcgcgc 


tcgggactca 


tctcccgcaa 


gtagggggtg 


atggagcttc 


ctggttggtt 


21496 


ctgacggttg 


gccattgtat 


cctaggcaga 


aagacatgga 


gcttatgcgc 


gaggaaactt 


21556 


taaccgcccc 


gtcccccgtc 


agcgacgaag 


aggtcatcgt 


cgaacaggac 


ccgggctacg 


21616 


ttacgccgcc 


cgaggatctg 


gaggggccct 


tagacgaccg 


gcgcgacgct 


agtgagcggc 


21676 


aggaaaatga 


gaaagaggag 


gaggagggct 


gctacctcct 


ggaaggcgac 


gttttgctaa 


21736 


agcatttcgc 


caggcagagc 


accatactca 


aggaggcctt 


gcaagaccgc 


tccgaggtgc 


21796 


ccttggacgt 


cgccgcgctc 


tcccaggcct 


acgaggcgaa 


ccttttctcg 


ccccgagtgc 


21856 


ctccgaagag 


acagcccaac 


ggcacctgcg 


agcccaaccc 


gcgactcaac 


ttctaccccg 


21916 


tgttcgccgt 


gcccgaggcg 


ctggccacct 


accacatctt 


tttcaaaaac 


cagcgcattc 


21976 


ccctttcctg 


ccgggccaac 


cgcaccgcgg 


ccgataggaa 


gctaacactc 


agaaacggag 


22036 


tcagcatacc 


tgatatcacg 


tcactggagg 


aagtgcctaa 


gatcttcgag 


ggtctgggtc 


22096 


gagatgagaa 


gcgggcggcg 


aacgctctgc 


agaaagaaca 


gaaagagagt 


cagaacgtgc 


22156 
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tqcrtcrgagct 


ggagggggac 


aacgcgcgtc 


tgaccgtcct 


caaacgttgc 


atagaagttt 


22216 


cccacttcgc 


ctacccggcc 


ctcaacctgc 


cgcccaaagt 


tatgaaatcg 


gtcatggacc 


22276 


agctactcat 


caagagagct 


gagcccctga 


atcccgacca 


ccctgaggcg 


gaaaactcag 


22336 


aggacggaaa 


gcccgtcgtc 


agcgacgagg 


agctcgagcg 


gtggctggaa 


accagggacc 


22396 


cccagcagtt 


gcaagagagg 


cgcaagatga 


tgatggcggc 


cgtgctggtc 


acggtggagc 


22456 


tagaatgcct 


gcaacggttt 


ttcagcgacg 


tggagacgct 


acgcaaaatc 


ggggagtccc 


22516 


tgcactacac 


cttccgccag 


ggctacgttc 


gccaggcctg 


caaaatctcc 


aacgtagagc 


22576 


tcagcaacct 


ggtttcctac 


atgggcatcc 


tccacgagaa 


ccggctgggg 


cagagcgtgc 


22636 


tgcactgcac 


cttgcaaggc 


gaggcgcgaa 


gggactacgt 


ccgagactgc 


gtctacctct 


22696 


tcctcaccct 


cacctggcag 


accgccatgg 


gcgtgtggca 


gcagtgcttg 


gaagagagaa 


22756 


acctcaaaga 


gctggacaaa 


ctcctctgcc 


gccagcggcg 


ggccctctgg 


accggcttca 


22816 


gcgagcgcac 


ggtcgcctgc 


gccctggcag 


acatcatttt 


cccagaacgc 


ctgatgaaaa 


22876 


ccttgcagaa 


cggcctgccg 


gatttcatca 


gtcagagcat 


cttgcaaaac 


ttccgctcct 


22936 


tcgtcctgga 


gcgctccggg 


atcttgcccg 


ccatgagctg 


cgcgctgcct 


tctgactttg 


22996 


tccccctttc 


ctaccgcgag 


tgccctcccc 


cactgtggag 


ccactgctac 


ctcttccaac 


23056 


tggccaactt 


tctggcctac 


cactccgacc 


tcatggaaga 


cgtgagcgga 


gaggggctgc 


23116 


tcgagtgcca 


ctgccgctgc 


aacctctgca 


ccccccacag 


atcgctggcc 


tgcaacaccg 


23176 


agctgctcag 


cgaaacccag 


gtcataggta 


ccttcgagat 


ccaggggccc 


cagcagcaag 


23236 


agggtgcttc 


cggcttgaag 


ctcactccgg 


cgctgtggac 


ctcggcttac 


ttacgcaaat 


23296 


ttgtagccga 


ggactaccac 


gcccacaaaa 


ttcagtttta 


cgaagaccaa 


tctcgaccac 


23356 


cgaaagcccc 


cctcacggcc 


tgcgtcatca 


cccagagcaa 


aatcctggcc 


caattgcaat 


23416 


ccatcaacca 


agcgcgccga 


gatttccttt 


tgaaaaaggg 


tcggggggtg 


tacctggacc 


23476 


cccagaccgg 


cgaggaactc 


aacccgtcca 


cactttccgt 


cgaagcagcc 


cccccgagac 


23536 


atgccaccca 


agggaaccgc 


caagcagctg 


atcgctcggc 


agagagcgaa 


gaagcaagag 


23596 


ctgctccagc 


agcaggtgga 


ggacgaggaa 


gagctgtggg 


acagccaggc 


agaggaggtg 


23656 


tcagaggacg 


aggaggagafc 


ggaaagctgy 
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cgagcttt ca 


23716 


gaggaagagg 


cgaccgaaga 


aaaaccacct 


gcatccagcg 


cgccttctct 


gagccgacag 


23776 


ccgaagcccc 


ggcccccgac 


gcccccggcc 


ggctcactca 


aagccagccg 


taggtgggac 


23836 


gccaccggat 


ctccagcggc 


agcggcaacg 


gcagcgggta 


aggccaaacg 


cgagcggcgg 


23896 
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gggtattgct cctggcggac ccacaaaagc agtatcgtga actgcttgca acactgcggg 23956 

ggaaacatct cctttgcccg acgctacctc ctcttccatc acggtgtggc cttccctcgc 24016 

aacgttctct attattaccg tcatctctac agcccctacg aaacgctcgg agaaaaaagc 24076 

taaggcctcc tctgccgcga ggaaaaactc cgccgccgct gccgccaagg atccgccggc 24136 

caccgaggag ctgagaaagc gcatctttcc cactctgtat gctatctttc agcaaagccg 24196 

cgggcagcac cctcagcgcg aactgaaaat aaaaaaccgc tccttccgct cactcacccg 24256 

cagctgtctg taccacaaga gagaagacca gctgcagcgc accctggacg acgccgaagc 24316 

actgttcagc aaatactgct cagcgtctct taaagactaa aagacccgcg ctttttcccc 24376 

ctcgggcgcc aaaacccacg tcatcgccag catgagcaag gagattccca ccccttacat 24436 

gtggagctat cagccccaga tgggcctggc cgcgggggcc gcccaggact actccagcaa 24496 

aatgaactgg ctcagcgccg gcccccacat gatctcacga gttaacggca tccgagccca 24556 

ccgaaaccag atcctcttag aacaggcggc aatcaccgcc acaccccggc gccaactcaa 24616 

cccgcccagt tggcccgccg cccaggtgta tcaggaaact ccccgcccga ccacagtcct 24676 

cctgccacgc gacgcggagg ccgaagtcct catgactaac tctggggtac aattagcggg 24736 

cgggtccagg tacgccaggt acagaggtcg ggccgctcct tactctcccg ggagtataaa 24796 

gagggtgatc attcgaggcc gaggtatcca gctcaacgac gaggcggtga gctcctcaac 24856 

cggtctcaga cctgacggag tcttccagct cggaggagcg ggccgctctt ccttcaccac 24916 

tcgccaggcc tacctgaccc tgcagagctc ttcctcgcag ccgcgctccg ggggaatcgg 24976 

cactctccag ttcgtggaag agttcgtccc ctccgtctac ttcaacccgt tttccggctc 25036 

acctggacgc tacccggacg ccttcattcc caactttgac gcagtgagtg aatccgtgga 25096 

cggctacgac tgatgacaga tggtgcggcc gtgagagctc ggctgcgaca tctgcatcac 25156 

tgccgccagc ctcgctgcta cgctcgggag gcgatcgtgt tcagctactt tgagctgccg 25216 

gacgagcacc ctcagggacc ggctcacggg ttgaaactcg agattgagaa cgcgcttgag 2527 6 

tctcacctca tcgacgcctt caccgcccgg cctctcctgg tagaaaccga acgcgggatc 25336 

actaccatca ccctgttctg catctgcccc acgcccggat tac atg aag ate tgt 25391 

Met Lys lie Cys 
1430 

gtt gtc ate ttt gcg etc agt tta ata aaa act gaa ctt ttt gec 25436 
Val Val lie Phe Ala Leu Ser Leu lie Lys Thr Glu Leu Phe Ala 
1435 1440 1445 
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gta cct tea acg 
Val Pro Ser Thr 



gga gtc ctt aac 
Gly Val Leu Asn 



gaa aaa caa eta 
Glu Lys Gin Leu 



get aca gga cag 
Ala Thr Gly Gin 



aca cca ccc eta 
Thr Pro Pro Leu 



ggt tta aca tta 
Gly Leu Thr Leu 



caa ctt acg ttt 
Gin Leu Thr Phe 



agt ctt aca ttc 
Ser Leu Thr Phe 



agt ctt get att 
Ser Leu Ala lie 



aac ggc aca etc 
Asn Gly Thr Leu 



age tct ggt gee 
Ser Ser Gly Ala 



cca ace aaa act 
Pro Thr Lys Thr 



ctg ctg agt ctt 
Leu Leu Ser Leu 



age gtg agt tta 
Ser Val Ser Leu 



cca cgc gtt 
Pro Arg Val 
1450 

tta cac ata 
Leu His lie 
1465 

gee ctt tea 
Ala Leu Ser 
1480 

ttg gaa age 
Leu Glu Ser 
1495 

cgt ggt att 
Arg Gly He 
1510 

age tat tea 
Ser Tyr Ser 
1525 

aat ttt act 
Asn Phe Thr 
1540 

aac tac act 
Asn Tyr Thr 
1555 

aac gtc aat 
Asn Val Asn 
1570 

get gta aat 
Ala Val Asn 
1585 

tta act ttt 
Leu Thr Phe 
1600 

cct aac tgt 
Pro Asn Cys 
1615 

tgc ttg act 
Cys Leu Thr 
1630 

age gga gtg 
Ser Gly Val 
1645 



gtt tct cct 
Val Ser Pro 
1455 

gca aaa ccc 
Ala Lys Pro 
147 0 

att gga aaa 
He Gly Lys 
1485 

aca gca age 
Thr Ala Ser 
1500 

tec cct tta 
Ser Pro Leu 
1515 

gat ccc ctg 
Asp Pro Leu 
1530 

tct cca eta 
Ser Pro Leu 
1545 

tct ccc atg 
Ser Pro Met 
1560 

ace tec aaa 
Thr Ser Lys 
1575 

gtt act cca 
Val Thr Pro 
1590 

ggc ata caa 
Gly He Gin 
1605 

ace gtg ttt 
Thr Val Phe 
1620 

aaa tgc gga 
Lys Cys Gly 
1635 

gca gga ace 
Ala Gly Thr 
1650 



tgt gaa aaa ace 
Cys Glu Lys Thr 



ttg tat ttt ace 
Leu Tyr Phe Thr 



ggg tta aca att 
Gly Leu Thr He 



gta cag gac age 
Val Gin Asp Ser 



aag ctg aca gac 
Lys Leu Thr Asp 



cgt gtg gta ggt 
Arg Val Val Gly 



cgt tac gaa aat 
Arg Tyr Glu Asn 



aca eta ata aac 
Thr Leu He Asn 



ggc etc agt agt 
Gly Leu Ser Ser 



gat ttt aga ttt 
Asp Phe Arg Phe 



agt eta tgg act 
Ser Leu Trp Thr 



ace gaa agt gac 
Thr Glu Ser Asp 



get cac gta ctt 
Ala His Val Leu 



atg eta aaa atg 
Met Leu Lys Met 



cca 25481 

Pro 

1460 

ata 25526 

He 

1475 

tct 25571 

Ser 

1490 

get 25616 

Ala 

1505 

aac 25661 

Asn 

1520 

gac 25706 

Asp 

1535 

ggc 25751 

Gly 

1550 

aac 25796 

Asn 

1565 

gac 25841 

Asp 

1580 

aac 25886 

Asn 

1595 

ttt 25931 

Phe 

1610 

tec 25976 

Ser 

1625 

gga 26021 

Gly 

1640 

ace 26066 

Thr 

1655 
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cac act tct gtt acc gtt cag ttt teg ttt gat gac agt ggt aaa 26111 

His Thr Ser Val Thr Val Gin Phe Ser Phe Asp Asp Ser Gly Lys 
1660 1665 1670 

eta ata ttc tct cca ctt gcg aac aac act tgg ggt gtt cga caa 26156 

Leu lie Phe Ser Pro Leu Ala Asn Asn Thr Trp Gly Val Arg Gin 
1675 1680 1685 

age gag agt ccg ttg ccc aac cca tec ttc aac get etc acg ttt 26201 

Ser Glu Ser Pro Leu Pro Asn Pro Ser Phe Asn Ala Leu Thr Phe 
1690 1695 1700 

atg cca aac agt acc att tat tct aga gga gca agt aac gaa cct 26246 

Met Pro Asn Ser Thr lie Tyr Ser Arg Gly Ala Ser Asn Glu Pro 
1705 1710 1715 

caa aac aat tat tat gtc cag acg tat ctt aga ggc aac gtg cga 26291 

Gin Asn Asn Tyr Tyr Val Gin Thr Tyr Leu Arg Gly Asn Val Arg 
1720 1725 1730 

aag cca att eta eta act gtt acc tac aac tea gtt aat tea gga 26336 

Lys Pro lie Leu Leu Thr Val Thr Tyr Asn Ser Val Asn Ser Gly 
1735 1740 1745 

tat tec tta act ttt aaa tgg gat get gtc gee aat gaa aaa ttt 26381 

Tyr Ser Leu Thr Phe Lys Trp Asp Ala Val Ala Asn Glu Lys Phe 
1750 1755 1760 



gee act cct aca tct teg ttt tgc tat gtt gca gag caa taa 
Ala Thr Pro Thr Ser Ser Phe Cys Tyr Val Ala Glu Gin 
1765 1770 



ggc 



26423 



aaccctgtta ccccaccgtc tcgttttttt cag atg aaa cga gcg aga gtt 26474 

Met Lys Arg Ala Arg Val 
1775 

gat gaa gac ttc aac cca gtg tac cct tat gac ccc cca tac get 26519 
Asp Glu Asp Phe Asn Pro Val Tyr Pro Tyr Asp Pro Pro Tyr Ala 
1780 1785 1790 

ccc gtc atg ccc ttc att act ccg cct ttt acc tec teg gat ggg 26564 
Pro Val Met Pro Phe lie Thr Pro Pro Phe Thr Ser Ser Asp Gly 
1795 1800 1805 

ttg cag gaa aaa cca ctt gga gtg tta agt tta aac tac agg gat 26609 
Leu Gin Glu Lys Pro Leu Gly Val Leu Ser Leu Asn Tyr Arg Asp 
1810 ** 1815 1820 

ccc att act aca caa aat ggg tct etc acg tta aaa eta gga aac 26654 
Pro lie Thr Thr Gin Asn Gly Ser Leu Thr Leu Lys Leu Gly Asn 
1825 1830 1835 



etc act eta aac aac cag gga cag tta aca tea act get ggc 26699 
Gly Leu Thr Leu Asn Asn -Gin Gly Gin Leu Thr Ser Thr Ala Gly 
1840 1845 1850 
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gaa gtg gag cct ccg etc act aat get aac aac aaa ctt gca eta 
Glu Val Glu Pro Pro Leu Thr Asn Ala Asn Asn Lys Leu Ala Leu 
1855 1860 1865 



26744 



gee tat age gaa cca tta gca gta aaa age aac cgc eta act eta 
Ala Tyr Ser Glu Pro Leu Ala Val Lys Ser Asn Arg Leu Thr Leu 
1870 1875 1880 



26789 



tea cac ace get ccc ctt gtc ate get aat aat tct tta gcg ttg 
Ser His Thr Ala Pro Leu Val He Ala Asn Asn Ser Leu Ala Leu 
1885 1890 1895 



26834 



caa gtt tea gag cct att ttt gta aat gac gat gac aag eta gee 
Gin Val Ser Glu Pro He Phe Val Asn Asp Asp Asp Lys Leu Ala 
1900 1905 J 1910 



26879 



ctg cag aca gee gee ccc ctt gta ace aac get 
Leu Gin Thr Ala Ala Pro Leu Val Thr Asn Ala 
1915 1920 1925 



ggc ace ctt cgc 
Gly Thr Leu Arg 



26924 



tta cag age get gee cct 
Leu Gin Ser Ala Ala Pro 
1930 1935 



tta gga ttg gtt gaa aat act ctt aaa 
Leu Gly Leu Val Glu Asn Thr Leu Lys 
194 0 



26969 



ctg ctg ttt tct aaa ccc ttg tat ttg caa aat gat ttt ctt gca 
Leu Leu Phe Ser Lys Pro Leu Tyr Leu Gin Asn Asp Phe Leu Ala 
1945 1950 1955 



27014 



tta gee att gaa cgc ccc 
Leu Ala He Glu Arg Pro 
1960 1965 



ctg get gta gca gee 
Leu Ala Val Ala Ala 
1970 



gca ggt act ctg 
Ala Gly Thr Leu 



27059 



ace eta caa ctt act cct cca tta aag act aac gat gac ggg eta 
Thr Leu Gin Leu Thr Pro Pro Leu Lys Thr Asn Asp Asp Gly Leu 
1975 1980 1985 



27104 



aca eta tec aca gtc gag cca tta act gta aaa aac gga aac eta 
Thr Leu Ser Thr Val Glu Pro Leu Thr Val Lys Asn Gly Asn Leu 
1990 1995 2000 



27149 



ggc ttg caa ata teg cgc 
Gly Leu Gin He Ser Arg 
2005 2010 



cct tta gtt gtt caa aac aac ggc ctt 
Pro Leu Val Val Gin Asn Asn Gly Leu 
2015 



27194 



teg ctt get att ace ccc ccg ctg cgt ttg ttt aac age gac ccc 
Ser Leu Ala He Thr Pro Pro Leu Arg Leu Phe Asn Ser Asp Pro 
2020 2025 2030 



27239 



gtt ctt ggt ttg ggc ttc act ttt ccc eta get gtc aca aac aac 
Val Leu Gly Leu Gly Phe Thr Phe Pro Leu Ala Val Thr Asn Asn 
2035 2040 2045 



27284 



etc etc tec tta aac atg 
Leu Leu Ser Leu Asn Met 
2050 2055 



gga gac gga gtt aaa ctt acc tat aat 
Gly Asp Gly Val Lys Leu Thr Tyr Asn 
2060 



27329 
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aaa eta aca gee aat ttg ggt agg gat tta caa ttt gaa aac ggt 27374 

Lys Leu Thr Ala Asn Leu Gly Arg Asp Leu Gin Phe Glu Asn Gly 

2065 2070 " ~ 2075 

gcg att gec gta acg ctt act gec gaa tta cct ttg caa tac act 27419 

Ala lie Ala Val Thr Leu Thr Ala Glu Leu Pro Leu Gin Tyr Thr 

2080 2085 2090 

aac aaa ctt caa ctg aat att gga get ggc ctt cgt tac aat gga 27464 

Asn Lys Leu Gin Leu Asn lie Gly Ala Gly Leu Arg Tyr Asn Gly 

2095 2100 2105 

gec age aga aaa eta gat gta aac att aac caa aat aaa ggc tta 27509 

Ala Ser Arg Lys Leu Asp Val Asn lie Asn Gin Asn Lys Gly Leu 

2110 2115 2120 

act tgg gac aac gat gca gtt att ccc aaa eta gga teg ggc tta 27554 

Thr Trp Asp Asn Asp Ala Val lie Pro Lys Leu Gly Ser Gly Leu 

2125 2130 2135 

caa ttt gac cct aat ggc aac ate get gtt ate cct gaa ace gtg 27599 

Gin Phe Asp Pro Asn Gly Asn lie Ala Val lie Pro Glu Thr Val 

2140 2145 2150 

aag ccg caa acg tta tgg acg act gca gat ccc teg cct aac tgc 27644 

Lys Pro Gin Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys 

2155 2160 2165 

tea gtg tac cag gac ttg gat gee agg ctg tgg etc get ctt gtt 27689 

Ser Val Tyr Gin Asp Leu Asp Ala Arg Leu Trp Leu Ala Leu Val 

2170 2175 2180 

aaa agt ggc gac atg gtg cat gga age att gec eta aaa gec eta 27734 

Lys Ser Gly Asp Met Val His Gly Ser lie Ala Leu Lys Ala Leu 

2185 2190 2195 

aaa ggg acg ttg eta aat cct aca gee age tac att tec att gtg 27779 

Lys Gly Thr Leu Leu Asn Pro Thr Ala Ser Tyr lie Ser lie Val 

2200 2205 2210 

ata tat ttt tac age aac gga gtc agg cgt ace aac tat cca acg 27824 

lie Tyr Phe Tyr Ser Asn Gly Val Arg Arg Thr Asn Tyr Pro Thr 

2215 2220 2225 

ttt gac aac gaa ggc ace tta get aac age gec act tgg gga tac 27869 

Phe Asp Asn Glu Gly Thr Leu Ala Asn Ser Ala Thr Trp Gly Tyr 

2230 2235 2240 

cga cag ggg caa tct get aac act aat gtg acc aat gec act gaa 27914 

Arg Gin Gly Gin Ser Ala Asn Thr Asn Val Thr Asn Ala Thr Glu 

2245 2250 2255 

ttt atg ccc age tea age agg tac ccc gtg aat aaa gga gac aac 27959 

Phe Met Pro Ser Ser Ser Arg Tyr Pro Val Asn Lys Gly Asp Asn 

2260 2265 2270 
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att caa aat caa tct ttt tea tac acc tgt att aaa gga gat ttt 

He Gin Asn Gin Ser Phe Ser Tyr Thr Cys He Lys Gly Asp Phe 

2275 2280 2285 

get atg cct gtc ccg ttc cgt gta aca tat aat cac gec ctg gaa 

Ala Met Pro Val Pro Phe Arg Val Thr Tyr Asn His Ala Leu Glu 

2290 2295 2300 

ggg tat tec ctt aag ttc acc tgg cgc gtt gta gec aat cag gec 

Gly Tyr Ser Leu Lys Phe Thr Trp Arg Val Val Ala Asn Gin Ala 

2305 ^ 2310 2315 

ttt gat att cct tgc tgt tea ttt tea tac ate aca gaa taa 

Phe Asp He Pro Cys Cys Ser Phe Ser Tyr He Thr Glu 

2320 " 2325 2330 



28004 



28049 



28094 



28136 
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tttcatttta 


atttcttttt 


attttacacg 


aacagtgaga 


cttcctccac 


28196 
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accagcctct 


cccccttcat 


agcagtaaac 


tgttgtgaat 


28256 
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aaaat ccaaa 


cagt ct cttt 


ggtgatgaaa 


cgtcgatcag 


28316 
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ccaacgtttc 


ggtgaaaaac 


tgcacaccgc 


28376 
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aaacaggttc 


aggct ctcca 


egggttat ct 


ccccgatcaa 


actcagacag 


28436 


crcrtaaaocrtcf 


cacrtcrcrtcftt 


ccactaaacc 


acgcaggtgg 


cgctgtctga 


acctctcggt 


28496 


gcgact cctg 


taagqctgcft 


aagaagttag 


attgtccagt 


agcctcacag 


catgtatcat 


28556 


cagtct acga 


atgcgtctqg 


cqcagcagcg 


catctgaatc 


tcactgagat 


teeggcaaga 


28616 


at cgcacacc 


at ca caa tea 


ggttgttcat 


gatcccatag 


ctgaacacgc 


tccagccaaa 


28676 


gctcattcgc 


tccaacagcg 


ccaccgcgtg 


t ccgt ccaac 


CT-T-aCT_.t.L.clci 


a Laaa L. Oct y 


2 8736 


gtgtctgccg 


cgtacaaaca 


tgctacccac 


atacagaact 


tcccggggca 


ggcccctgtt 


28796 


caccacctgt 


ctgtaccagg 


gaaacctcac 


atttatcagg 


gagecataga 


tggccatttt 


28856 


aaaccaatta 


gctaataccg 


ccccaccagc 


tctacactga 


agagaacegg 


gagagttaca 


28916 


atgacagtga 


ataatccatc 


tctcataacc 


cctgatggtc 


tgatgaaaat 


ctagatctaa 


28976 


cgtggcacaa 


caaatacaca 


ctttcatata 


cattttcata 


acatgttttt 


cccaggccgt 


29036 


taaaatacaa 


tcccaataca 


cgggccactc 


ctgeagtaca 


ataaagctaa 


tacaagatgg 


29096 


tatactcctc 


acctcactga 


cactgtgcat 


gttcatattt 


tcacattcta 


agtaccgaga 


29156 


gttctcctct 


acagcagcac 


tgctgcggtc 


ctcacaaggt 


ggtagctggt 


gatgattgta 


29216 


gggggccagt 


ctgeagegat 


accgtctgtc 


gcgttgcatc 


gtagaccagg 


aaccgacgca 


29276 


cctcctcgta 


cttgtggtag 


cagaaccacg 


tccgctgcca 


gcacgtctcc 


acgtaacgcc 


29336 


ggtccctgcg 


tcgctcacgc 


tccctcctca 


atgcaaagtg 


caaccactct 


tgtaatccac 


29396 
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acagatccct 


ctcggcctcc 


ggggtgatgc 


acacctcaaa 


cctacagatg 


tctcggtaca 


29456 


gttccaaaca 


cgtagtgagg 


gcgagttcca 


accaagacag 


acagcctgat 


ctatcccgac 


29516 


acactggagg 


tggaggaaga 


cacggaagag 


gcatgttatt 


ccaagcgatt 


caccaacggg 


29576 


tcgaaatgaa 


gatcccgaag 


atgacaacgg 


tcgcctccgg 


agccctgatg 


gaatttaaca 


29636 


gccagatcaa 


acgttatgcg 


attctccaag 


ctatcgatcg 


ccgcttccaa 


aagagcctgg 


29696 


acccgcactt 


ccacaaacac 


cagcaaagca 


aaagcactat 


tatcaaactc 


ttcaatcatc 


29756 


aagctgcagg 


actgtacaat 


gcctaagtaa 


ttttcgtttc 


tccactcgcg 


aatgatgtcg 


29816 


cggcagatag tctgaaggtt 


catcccgtgc 


agggtaaaaa 


gctccgaaag 


ggcgccctct 


29876 


acagccatgc 


gtagacacac 


catcatgact 


gcaagatatc 


gggctcctga 


gacacctgca 


29936 


gcagatttaa 


cagatcaagg 


tcaggttgct 


ctccgcgatc 


acgaatctcc 


atccgcaagg 


29996 


tcatttgcaa 


aaaattaaat 


aaatctatgc 


cgactagatc 


tgtcaactcc 


gcattaggaa 


30056 


ccaaatcagg 


tgtggctacg 


cagcacaaaa 


gttccaggga 


tggtgccaaa 


ctcactagaa 


30116 


ccgctcccga 


gtaacaaaac 


tgatgaatgg 


gagtaacaca 


gtgtaaaatg 


tgcaaccaaa 


30176 


aatcactaag gtgctccttt 


aaaaagtcca 


gtacttctat 


attcagtccg 


tgcaagtact 


30236 


gaagcaactg tgcgggaata 


tgcacaacaa 


aaaaaatagg 


gcggctcaga 


tacatgttga 


30296 


cctaaaataa 


aaagaatcat 


taaactaaag 


aagcttggcg 


aacggtggga 


taaatgacac 


30356 


gctccagcag 


cagacaggca 


accggctgtc 


cccgggaacc 


gcggtaaaat 


tcatccgaat 


30416 


gattaaaaag 


aacaacagaa 


acttcccacc 


atgtactcgg 


ttggatctcc 


tgagcacaca 


30476 


gcaatacccc 


cctcacattc 


atgtccgcca 


cagaaaaaaa 


acgtcccaga 


tacccagcgg 


30536 


ggatatccaa 


cgacagctgc 


aaagacagca 


aaacaatccc 


tctgggagcg 


atcacaaaat 


30596 


cctccggtga 


aaaaagcaca 


tacatattag 


aataaccctg 


ttgctggggc 


aaaaaggccc 


30656 


ggcgtcccag 


caaatgcaca 


taaatatgtt 


catcagccat 


tgccccgtct 


taccgcgtaa 


30716 


tcagccacga 


aaaaatcgag 


ctaaaattca 


cccaacagcc 


tatagctata 


tatacactcc 


30776 


gcccaatgac 


gctaataccg 


caccacccac 


gaccaaagtt 


cacccacacc 


cacaaaaccc 


30836 


gcgaaaatcc 


agcgccgtca 


gcacttccgc 


aatttcagtc 


tcacaacgtc 


acttccgcgc 


30896 


gccttttcac 


attcccacac 


acacccgcgc 


ccttcgcccc 


gccctcgcgc 


caccccgcgt 


30956 


caccgcacgt 


caccccggcc 


ccgcctcgct 


cctccccgct 


cattatcata 


ttggcacgtt 


31016 


tccagaataa 


ggtatattat 


tgatgatg 








31044 
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<210> 30 
<211> 505 
<212> PRT 

<213> simian adenovirus SV-25 
<400> 30 

Met Arg Arg Ala Val Arg Val Thr Pro Ala Ala Tyr Glu Gly Pro Pro 
15 10 15 

Pro Ser Tyr Glu Ser Val Met Gly Ser Ala Asn Val Pro Ala Thr Leu 
20 25 30 

Glu Ala Pro Tyr Val Pro Pro Arg Tyr Leu Gly Pro Thr Glu Gly Arg 
35 ~ 40 45 

Asn Ser lie Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr Lys 
50 55 60 

Val Tyr Leu Val Asp Asn Lys Ser Ala Asp lie Ala Ser Leu Asn Tyr 
65 " 70 75 80 

Gin Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn Asp 
85 90 95 

Phe Thr Pro Thr Glu Ala Gly Thr Gin Thr lie Asn Phe Asp Glu Arg 
100 105 110 

Ser Arg Trp Gly Gly Gin Leu Lys Thr lie Leu His Thr Asn Met Pro 
115 " 120 125 

Asn lie Asn Glu Phe Met Ser Thr Asn Lys Phe Arg Ala Lys Leu Met 
130 135 140 

Val Glu Lys Ser Asn Ala Glu Thr Arg Gin Pro Arg Tyr Glu Trp Phe 
145 ' 150 155 160 

Glu Phe Thr He Pro Glu Gly Asn Tyr Ser Glu Thr Met Thr He Asp 
165 170 175 

Leu Met Asn Asn Ala lie Val Asp Asn Tyr Leu Gin Val Gly Arg Gin 
180 185 190 

Asn Gly Val Leu Glu Ser Asp He Gly Val Lys Phe Asp Thr Arg Asn 
195 200 205 

Phe Arg Leu Gly Trp Asp Pro Val Thr Lys Leu Val Met Pro Gly Val 
210 215 22 0 

Tyr Thr Asn Glu Ala Phe His Pro Asp He Val Leu Leu Pro Gly Cys 
225 230 235 240 

Gly Val Asp Phe Thr Gin Ser Arg Leu Ser Asn Leu Leu Gly He Arg 
245 25 0 255 
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Lys Arg Arg Pro Phe Gin Glu Gly Phe Gin lie Met Tyr Glu Asp Leu 
260 2 65 27 0 

Glu Gly Gly Asn lie Pro Ala Leu Leu Asp Val Ser Lys Tyr Glu Ala 
275 280 285 

Ser lie Gin Arg Ala Lys Ala Glu Gly Arg Glu He Arg Gly Asp Thr 
290 295 300 

Phe Ala Val Ala Pro Gin Asp Leu Glu He Val Pro Leu Thr Lys Asp 
305 310 315 320 

Ser Lys Asp Arg Ser Tyr Asn lie He Asn Asn Thr Thr Asp Thr Leu 
325 330 335 

Tyr Arg Ser Trp Phe Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly 
340 345 350 

Val Arg Ser Trp Thr He Leu Thr Thr Thr Asp Val Thr Cys Gly Ser 
355 ' 360 365 

Gin Gin Val Tyr Trp Ser Leu Pro Asp Met Met Gin Asp Pro Val Thr 
370 "* 375 380 

Phe Arg Pro Ser Thr Gin Val Ser Asn Phe Pro Val Val Gly Thr Glu 
385 ~ 390 395 400 

Leu Leu Pro Val His Ala Lys Ser Phe Tyr Asn Glu Gin Ala Val Tyr 
405 410 415 

Ser Gin Leu He Arg Gin Ser Thr Ala Leu Thr His Val Phe Asn Arg 
420 ~ 425 430 

Phe Pro Glu Asn Gin He Leu Val Arg Pro Pro Ala Pro Thr He Thr 
435 440 445 

Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro 
450 455 460 

Leu Arg Ser Ser He Ser Gly Val Gin Arg Val Thr He Thr Asp Ala 
465 " 470 475 480 

Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly Val Val Ala 
485 490 495 

Pro Lys Val Leu Ser Ser Arg Thr Phe 
500 505 

<210> 31 
<211> 921 
<212> PRT 

<213> simian adenovirus SV-25 
<400> 31 

Met Ala Thr Pro Ser Met Met Pro Gin Trp Ser Tyr Met His He Ala 
15 10 15 
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Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 
2 0 25 30 

Arg Ala Thr Asp Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro 
35 ~ 40 45 

Thr Val Ala Pro Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu 
50 55 60 

Thr Leu Arg Phe Val Pro Val Asp Arg Glu Asp Thr Ala Tyr Ser Tyr 
65 " 70 75 80 

Lys Val Arg Tyr Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 
85 90 95 

Ala Ser Thr Tyr Phe Asp lie Arg Gly Val Leu Asp Arg Gly Pro Ser 
100 105 110 

Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 
115 ^ 120 125 

Ala Pro Asn Pro Ser Glu Trp Thr Asp Thr Ser Asp Asn Lys Leu Lys 
130 135 140 

Ala Tyr Ala Gin Ala Pro Tyr Gin Ser Gin Gly Leu Thr Lys Asp Gly 
145 150 155 160 

lie Gin Val Gly Leu Val Val Thr Glu Ser Gly Gin Thr Pro Gin Tyr 
165 170 175 

Ala Asn Lys Val Tyr Gin Pro Glu Pro Gin lie Gly Glu Asn Gin Trp 
180 185 190 

Asn Leu Glu Gin Glu Asp Lys Ala Ala Gly Arg Val Leu Lys Lys Asp 
195 200 205 

Thr Pro Met Phe Pro Cys Tyr Gly Ser Tyr Ala Arg Pro Thr Asn Glu 
210 215 220 

Gin Gly Gly Gin Ala Lys Asn Gin Glu Val Asp Leu Gin Phe Phe Ala 
225 " " 230 235 240 

Thr Pro Gly Asp Thr Gin Asn Thr Ala Lys Val Val Leu Tyr Ala Glu 
245 250 255 

Asn Val Asn Leu Glu Thr Pro Asp Thr His Leu Val Phe Lys Pro Asp 
260 265 270 

Asp Asp Ser Thr Ser Ser Lys Leu Leu Leu Gly Gin Gin Ala Ala Pro 
275 280 285 

Asn Arg Pro Asn Tyr He Gly Phe Arg Asp Asn Phe He Gly Leu Met 
290 295 300 
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Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser 
305 310 315 320 

Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser 
325 330 335 

Tyr Gin Leu Met Leu Asp Ala Leu Gly Asp Arg Ser Arg Tyr Phe Ser 
340 345 350 

Met Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg lie lie 
355 360 365 

Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu 
370 ^ 375 380 

Gly Gly Met Val Val Thr Asp Asn Tyr Asn Ser Val Thr Pro Gin Asn 
385 " 390 395 400 

Gly Gly Ser Gly Asn Thr Trp Gin Ala Asp Asn Thr Thr Phe Ser Gin 
405 410 415 

Arg Gly Ala Gin lie Gly Ser Gly Asn Met Phe Ala Leu Glu lie Asn 
420 425 430 

Leu Gin Ala Asn Leu Trp Arg Gly Phe Leu Tyr Ser Asn He Gly Leu 
435 440 445 

Tyr Leu Pro Asp Ser Leu Lys He Thr Pro Asp Asn He Thr Leu Pro 
450 455 460 

Glu Asn Lys Asn Thr Tyr Gin Tyr Met Asn Gly Arg Val Thr Pro Pro 
465 ~ 470 475 480 

Gly Leu He Asp Thr Tyr Val Asn Val Gly Ala Arg Trp Ser Pro Asp 
4 85 4 90 4 95 

Val Met Asp Ser He Asn Pro Phe Asn His His Arg Asn Ala Gly Leu 
500 505 510 

Arg Tyr Arg Ser Met Leu Leu Gly Asn Gly Arg Tyr Val Pro Phe His 
515 520 525 

lie Gin Val Pro Gin Lys Phe Phe Ala He Lys Asn Leu Leu Leu Leu 
530 535 540 

Pro Gly Ser Tyr Thr Tyr Glu Trp Asn Phe Arg Lys Asp Val Asn Met 
545 550 555 560 

He Leu Gin Ser Ser Leu Gly Asn Asp Leu Arg Val Asp Gly Ala Ser 
565 570 575 

He Arg Phe Asp Ser He Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala 
580 585 590 

His Asn Thr Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn 
595 600 605 



179 



WO 03/046124 



PCT/US02/33645 



Asp Gin Ser Phe Asn Asp Tyr Leu Cys Ala Ala Asn Met Leu Tyr Pro 
610 615 620 

lie Pro Ala Asn Ala Thr Ser Val Pro lie Ser lie Pro Ser Arg Asn 
625 630 635 640 

Trp Ala Ala Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Thr Lys Glu 
645 650 655 

Thr Pro Ser Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly 
660 665 670 

Ser lie Pro Tyr Leu Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys 
675 680 685 

Lys Val Ser lie Met Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp 
690 695 700 

Arg Leu Leu Thr Pro Asn Glu Phe Glu He Lys Arg Ser Val Asp Gly 
705 710 715 720 

Glu Gly Tyr Asn Val Ala Gin Ser Asn Met Thr Lys Asp Trp Phe Leu 
725 730 735 

He Gin Met Leu Ser His Tyr Asn He Gly Tyr Gin Gly Phe Tyr Val 
740 745 750 

Pro Glu Asn Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin 
755 ~ ~ 760 765 

Pro Met Ser Arg Gin Val Val Asp Thr Val Thr Tyr Thr Asp Tyr Lys 
770 775 780 

Asp Val Lys Leu Pro Tyr Gin His Asn Asn Ser Gly Phe Val Gly Tyr 
785 " 790 795 800 

Met Gly Pro Thr Met Arg Glu Gly Gin Ala Tyr Pro Ala Asn Tyr Pro 
805 810 815 

Tyr Pro Leu He Gly Glu Thr Ala Val Pro Ser Leu Thr Gin Lys Lys 
820 825 830 

Phe Leu Cys Asp Arg Val Met Trp Arg He Pro Phe Ser Ser Asn Phe 
835 840 845 

Met Ser Met Gly Ser Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala 
850 855 860 

Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu Val Asp Pro Met Asp 
865 870 875 880 

Glu Pro Thr Leu Leu Tyr Val Leu Phe Glu Val Phe Asp Val Val Arg 
885 890 895 

He His Gin Pro His Arg Gly Val He Glu Ala Val Tyr Leu Arg Thr 
900 905 910 
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Pro Phe 



Ser Ala Gly Asn Ala Thr Thr 
915 920 



<210> 32 
<211> 347 
<212> PRT 

<213> simian adenovirus SV-25 
<400> 32 

Met Lys lie Cys Val Val lie Phe Ala Leu Ser Leu lie Lys Thr Glu 
15 10 15 

Leu Phe Ala Val Pro Ser Thr Pro Arg Val Val Ser Pro Cys Glu Lys 
20 25 30 

Thr Pro Gly Val Leu Asn Leu His lie Ala Lys Pro Leu Tyr Phe Thr 
35 40 45 

lie Glu Lys Gin Leu Ala Leu Ser lie Gly Lys Gly Leu Thr lie Ser 
50 " 55 60 

Ala Thr Gly Gin Leu Glu Ser Thr Ala Ser Val Gin Asp Ser Ala Thr 
65 " 70 75 80 

Pro Pro Leu Arg Gly lie Ser Pro Leu Lys Leu Thr Asp Asn Gly Leu 
85 90 95 

Thr Leu Ser Tyr Ser Asp Pro Leu Arg Val Val Gly Asp Gin Leu Thr 
100 105 110 

Phe Asn Phe Thr Ser Pro Leu Arg Tyr Glu Asn Gly Ser Leu Thr Phe 
115 120 125 

Asn Tyr Thr Ser Pro Met Thr Leu lie Asn Asn Ser Leu Ala lie Asn 
130 135 140 

Val Asn Thr Ser Lys Gly Leu Ser Ser Asp Asn Gly Thr Leu Ala Val 
145 ' 150 155 160 

Asn Val Thr Pro Asp Phe Arg Phe Asn Ser Ser Gly Ala Leu Thr Phe 
165 170 175 

Gly lie Gin Ser Leu Trp Thr Phe Pro Thr Lys Thr Pro Asn Cys Thr 
180 185 190 

Val Phe Thr Glu Ser Asp Ser Leu Leu Ser Leu Cys Leu Thr Lys Cys 
195 200 205 

Gly Ala His Val Leu Gly Ser Val Ser Leu Ser Gly Val Ala Gly Thr 
210 215 220 

Met Leu Lys Met Thr His Thr Ser Val Thr Val Gin Phe Ser Phe Asp 



225 



230 



235 



240 
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Asp Ser Gly Lys 



Val Arg Gin Ser 
260 

Thr Phe Met Pro 
275 

Pro Gin Asn Asn 
290 

Lys Pro lie Leu 
305 

Ser Leu Thr Phe 



Pro Thr Ser Ser 
340 



Leu lie Phe Ser 
245 

Glu Ser Pro Leu 



Asn Ser Thr lie 
280 

Tyr Tyr Val Gin 
295 

Leu Thr Val Thr 
310 

Lys Trp Asp Ala 
325 

Phe Cys Tyr Val 



Pro Leu Ala Asn 
250 

Pro Asn Pro Ser 
265 

Tyr Ser Arg Gly 



Thr Tyr Leu Arg 
300 

Tyr Asn Ser Val 
315 

Val Ala Asn Glu 
330 

Ala Glu Gin 
345 



Asn Thr Trp Gly 
255 

Phe Asn Ala Leu 
270 

Ala Ser Asn Glu 
285 

Gly Asn Val Arg 



Asn Ser Gly Tyr 
320 

Lys Phe Ala Thr 
335 



<210> 33 

<211> 559 

<212> PRT 

<213> simian adenovirus SV-25 



<400> 33 

Met Lys Arg Ala 
1 

Asp Pro Pro Tyr 
20 

Ser Ser Asp Gly 
35 



Tyr Arg Asp Pro 
50 

Gly Asn Gly Leu 
65 

Gly Glu Val Glu 



Arg Val Asp Glu 
5 

Ala Pro Val Met 



Leu Gin Glu Lys 
40 

lie Thr Thr Gin 
55 

Thr Leu Asn Asn 
70 

Pro Pro Leu Thr 
85 



Asp Phe Asn Pro 
10 

Pro Phe lie Thr 
25 

Pro Leu Gly Val 



Asn Gly Ser Leu 
60 

Gin Gly Gin Leu 
75 

Asn Ala Asn Asn 
90 



Val Tyr Pro Tyr 
15 

Pro Pro Phe Thr 
30 

Leu Ser Leu Asn 
45 

Thr Leu Lys Leu 



Thr Ser Thr Ala 
80 

Lys Leu Ala Leu 
95 



Ala Tyr Ser Glu Pro Leu Ala Val Lys Ser Asn Arg Leu Thr Leu Ser 
100 105 110 

His Thr Ala Pro Leu Val lie Ala Asn Asn Ser Leu Ala Leu Gin Val 
115 120 125 

Ser Glu Pro lie Phe Val Asn Asp Asp Asp Lys Leu Ala Leu Gin Thr 
130 135 140 
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Ala Ala Pro Leu Val Thr Asn Ala Gly Thr Leu Arg Leu Gin Ser Ala 
145 150 155 160 

Ala Pro Leu Gly Leu Val Glu Asn Thr Leu Lys Leu Leu Phe Ser Lys 
165 170 175 

Pro Leu Tyr Leu Gin Asn Asp Phe Leu Ala Leu Ala lie Glu Arg Pro 
180 185 190 

Leu Ala Val Ala Ala Ala Gly Thr Leu Thr Leu Gin Leu Thr Pro Pro 
195 200 205 

Leu Lys Thr Asn Asp Asp Gly Leu Thr Leu Ser Thr Val Glu Pro Leu 
210 ~ 215 220 

Thr Val Lys Asn Gly Asn Leu Gly Leu Gin lie Ser Arg Pro Leu Val 
225 230 235 240 

Val Gin Asn Asn Gly Leu Ser Leu Ala lie Thr Pro Pro Leu Arg Leu 
245 250 255 

Phe Asn Ser Asp Pro Val Leu Gly Leu Gly Phe Thr Phe Pro Leu Ala 

"260 265 270 

Val Thr Asn Asn Leu Leu Ser Leu Asn Met Gly Asp Gly Val Lys Leu 
275 280 285 

Thr Tyr Asn Lys Leu Thr Ala Asn Leu Gly Arg Asp Leu Gin Phe Glu 
290 295 300 

Asn Gly Ala lie Ala Val Thr Leu Thr Ala Glu Leu Pro Leu Gin Tyr 
305 ~ 310 315 320 

Thr Asn Lys Leu Gin Leu Asn lie Gly Ala Gly Leu Arg Tyr Asn Gly 
325 330 335 

Ala Ser Arg Lys Leu Asp Val Asn lie Asn Gin Asn Lys Gly Leu Thr 
340 345 350 

Trp Asp Asn Asp Ala Val lie Pro Lys Leu Gly Ser Gly Leu Gin Phe 
355 360 365 

Asp Pro Asn Gly Asn lie Ala Val lie Pro Glu Thr Val Lys Pro Gin 
370 375 380 

Thr Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Cys Ser Val Tyr Gin 
385 390 395 400 

Asp Leu Asp Ala Arg Leu Trp Leu Ala Leu Val Lys Ser Gly Asp Met 
405 410 415 

Val His Gly Ser He Ala Leu Lys Ala Leu Lys Gly Thr Leu Leu Asn 
420 425 430 

Pro Thr Ala Ser Tyr He Ser He Val lie Tyr Phe Tyr Ser Asn Gly 
435 440 445 
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Val Arg Arg Thr Asn Tyr Pro Thr 
450 455 

Asn Ser Ala Thr Trp Gly Tyr Arg 
465 470 

Val Thr Asn Ala Thr Glu Phe Met 
485 

Asn Lys Gly Asp Asn lie Gin Asn 
500 

Lys Gly Asp Phe Ala Met Pro Val 
515 520 

Ala Leu Glu Gly Tyr Ser Leu Lys 
530 535 

Gin Ala Phe Asp lie Pro Cys Cys 
545 550 



<210> 34 

<211> 34115 

<212> DNA 

<213> simian adenovirus SV-39 
<220> 

<221> CDS 

<222> (13448) . . (14959) 

<223> L2 Penton 

<220> 

<221> CDS 

<222> (17785) . . (20538) 

<223> L3 Hexon 

<220> 

<221> CDS 

<222> (29515) . . (31116) 

<223> L5 Fiber #1 



<400> 34 



catcatcaat 


ataacaccgc 


aagatggcga 


ccgagttaac 


atgcaaatga 


ggtgggcgga 


60 


gttacgcgac 


ctttgtcttg 


ggaacgcgga 


agtgggcgcg 


gcgggtttcg 


gggaggagcg 


120 


cggggcgggg 


cgggcgtgtc 


gcgcggcggt 


gacgcgccgg 


ggacccggaa 


attgagtagt 


180 


ttttattcat 


tttgcaagtt 


tttctgtaca 


ttttggcgcg 


aaaactgaaa 


cgaggaagtg 


240 


aaaagtgaaa 


aatgccgagg 


tagtcaccgg 


gtggagatct 


gacctttgcc 


gtgtggagtt 


300 


tacccgctga 


cgtgtgggtt 


tcggtctcta 


ttttttcact 


gtggttttcc 


gggtacggtc 


360 



Phe Asp 

Gin Gly 

Pro Ser 
490 

Gin Ser 
505 

Pro Phe 
Phe Thr 
Ser Phe 



Asn Glu 
460 

Gin Ser 
475 

Ser Ser 



Phe Ser 



Arg Val 



Trp Arg 
540 

Ser Tyr 
555 



Gly Thr 

Ala Asn 

Arg Tyr 

Tyr Thr 
510 

Thr Tyr 
525 

Val Val 
lie Thr 



Leu Ala 

Thr Asn 
480 

Pro Val 
495 

Cys lie 
Asn His 
Ala Asn 
Glu 
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aaacratcccc 

CJ. CT. Ol y y ^ ^ 


attttatgac 


tccacgtcag 


ctgategcta 


gggtatttaa 


tgcgcctcag 


420 


Qi i y uuaay ci 


rjrfccactctt 


aaatcrcccrcf c 


aaaaacfacrtt 


ttctcctccg 


cgttccgcca 


480 


—j i~»4- /-t+~ rr» a 33 
aULLJ Ly aaaa 


^s^hrfarrrfaar' 
ct gl o y cl y y a a 1 — • 


ttcttactat 


ct cegggget 


gccagcgacc 


gtagccgccg 


540 


— i /—I 4— j-v4— 4— a 

agcug Lugya 


y y auct u Ly ll 


3r*r , rfrran*r , 'r~r'' 

ci ' ^-*y y ci y « 


tgggagacga 


t cct caggtg 


atttctcact 


600 


4—4-4— i^r 4- /-n - ^ a <t a 

l. l l y uy day a. 


4-4-4-4- rr 4- (-4-4- 
L L L Lay LULL 


p\ ~\- rf ^ f" fl - r*t" 

* — ■ ci l- y a i— w i— 


ataatattaa 


tccgggtgtt 


gaggggcaag 


660 


3 rras t" rra a i~ rf 
a. y y o. y a. ci y 


act crcracrt ct 


atacrataacyt 


tttttccgga 


cgctatgctg 


ctagaggctg 


720 


ex i_. y i_*a 


acct cacaac 


tctcacactg 


agcccgagtc 


agetgetatt 


cctgaattgt 


780 


r*3 "h rrrr +* rra 


a ft* tcract t Cf 


gcttgttacg 


agactatgee 


teegqagteg 


gatgaggagg 


840 


anarrr'rrrfrra't" 

cj. * — ■ ci y v-* y y y ct. 


cagcgatccc 


aeggctttta 


tggtctctaa 


ggegattget 


atactaaaag 


900 


aay a Ly ct. uy a 


trfrrr*rTatcfai 

\— y y y a w y c* i— 


ggattt cgac 


tggacgctcc 


ggcggtgccg 


gggagagact 


960 


>-r 4— ^3 /*>r4— /— « 4— /~f 

gtaay tccty 


L y Ct Ct LaLLaL 


r , rfrfrr?}"l - r , rri*3 

yyy y 


cccrrracraccc 
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ctgagtagtc tgtgcgcttt ttgtgcttgg ggttaggcag ggagtaggtg acgtcgttaa 7260 

agaggatttt gccacatctg ggcataaagt tgcgagagat tctgaagggg ccgggcacct 7320 

ccgagcggtt gttgatgact tgggcagcca ggagaatttc gtcgaagccg ttgatgttgt 7380 

gccccacgac gtagaactct atgaaacgcg gagcgccgcg cagcaggggg cacttttcaa 7440 

gttgctggaa agtaagttcc cgcggctcga cgccgtgttc cgtgcggctc cagtcctcca 7500 

ccgggtttcg ctccacaaaa tcctgccaga tgtggtcgac tagcaagagc tgcagtcggt 7560 

cgcgaaattc gcggaatttt ctgccgatgg cttgcttctg ggggttcaag caaaaaaagg 7620 

tgtctgcgtg gtcgcgccag gcgtcccagc cgagctcgcg agccagattc agggccagca 7680 

gcaccagagc cggctcaccg gtgattttca tgacgaggag aaagggcacc agctgttttc 7740 

cgaacgcgcc catccaggtg taggtctcca cgtcgtaggt gagaaacaga cgttcggtcc 7800 

gcgggtgcga tcccaggggg aaaaacttga tgggctgcca ccattgggag ctctgggcgt 7860 

ggatgtgatg gaagtaaaag tcccggcggc gcgtggaaca ttcgtgctgg tttttgtaaa 7920 

agcggccgca gtggtcgcag cgcgagacgg agtgaaggct gtgaatcagg tgaatcttgc 7980 

gtcgctgagg gggccccaga gccaaaaagc ggagcgggaa cgaccgcgcg gccacttcgg 8040 

cgtccgcagg caagatggat gagggttcca ccgttccccg cccgcggacc gaccagactt 8100 

ccgccagctg cggcttcagt tcttgcacca gctctcgcag cgtttcgtcg ctgggcgaat 8160 

cgtgaatacg gaagttgtcg ggtagaggcg ggaggcggtg gacttccagg aggtgtgtga 8220 

gggccggcag gagatgcagg tggtacttga tttcccacgg atgacggtcg cgggcgtcca 8280 

aggcgaagag atgaccgtgg ggccgcggcg ccaccagcgt tccgcggggg gtctttatcg 8340 

gcggcgggga cgggctcccg gcggcagcgg cggctcggga cccgcgggca agtcgggcag 8400 

cggcacgtcg gcgtggagct cgggcagggg ctggtgctgc gcgcggagct gactggcaaa 8460 

ggctatcacc cggcgattga cgtcctggat ccggcggcgc tgcgtgaaga ccaccggacc 8520 

cgtggtcttg aacctgaaag agagttcgac agaatcaatc tcggcatcgt taaccgcggc 8580 

ctggcgcagg atttcggcca cgtccccgga gttgtcttga tacgcgattt ctgccatgaa 8 640 

ctggtcgatt tcctcttcct gcaagtctcc gtgaccggcg cgttcgacgg tggccgcgag 8700 

atcgttggag atgcggccca tgagctggga aaaggcattg atgccgacct cgttccacac 87 60 

tcggctgtac accacctctc cgtgaacgtc gcgggcgcgc atcaccacct gggcgagatt 8820 

gagttccacg tggcgggcga aaaccggata gtttcggagg cgctgataca gatagttgag 8880 

ggtggtggcg gcgtgctcgg ccacaaaaaa atacatgatc cagcggcgga gggtcagctc 8 94 0 
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gctgggtggc 


10080 


gggggcgttg 


ggggcgagat 


cttccaacat 


gaggcggtga 


tagccgtaga 


tgtagcgcga 


10140 


catccaagtg 


atgccgctgg 


ccgtggtgct 


ggcgcgggcg 


tagtcgcgaa 


cgcggttcca 


10200 


gatgtttcgc 


agcggctgga 


agtactcgat 


ggtggggcga 


ctctgccccg 


tgaggcgggc 


10260 


gcagtcggcg 


atgctctacg 


gggaaaaaga 


agggccagtg 


aacaaccgcc 


ttccgtagcc 


10320 


ggaggagaac 


gcaagggggt 


caaagaccac 


cgaggctcgg 


gttcgaaacc 


cgggtggcgg 


10380 


cccgaatacg 


gagggcggtt 


ttttgctttt 


ttctcagatg 


catcccgtgc 


tgcggcagat 


10440 


gcgtccgaac 


gcggggtccc 


agtccccggc 


ggtgcctgcg 


gccgtgacgg 


cggcttctac 


10500 


ggccacgtcg 


cgctccaccc 


cgcctaccac 


ggcccaggcg 


gcggtggctc 


tgcgcggcgc 


10560 


aggggaaccc 


gaagcagagg 


cggtgttgga 


cgtggaggag 


ggccaggggt 


tggctcggct 


10620 
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gggggccctg agtcccgagc ggcacccgcg cgtggctctg aagcgcgacg cggcggaggc 10680 

gtacgtgccg cggagcaatc tgtttcgcga ccgcagcggc gaggaggccg aggagatgcg 10740 

agacttgcgt tttcgggcgg ggagggagtt gcgtcacggg ctggaccggc agagggttct 10800 

gagagaggag gactttgagg cggacgagcg cacgggggtg agtcccgcgc gggctcacgt 108 60 

ggcggccgcc aacctggtga gcgcgtacga gcagacggtc aaggaggaga tgaacttcca 10920 

gaagagcttc aatcatcacg tgcgcacgct gattgcgcgc gaagaggtgg ccatcggcct 10980 

catgcatctg tgggattttg tggaggcgta cgttcagaac cccagcagca agccgctgac 11040 

ggctcagctg ttcctcatcg tgcaacatag tcgagacaac gaaacgttca gggaggccat 11100 

gctgaacatt gcagagcctg aggggcgctg gctcttggat ctcattaaca tcttgcagag 11160 

tatcgtagtg caggagcgct cgctgagcct ggccgacaag gtggctgcca tcaactacag 11220 

catgctgtcg ctgggcaaat tttacgcccg caagatctac aagtctccgt tcgtccccat 11280 

agacaaggag gtgaagatag acagctttta catgcgcatg gcgctcaagg tgctgactct 11340 

aagcgacgac ctgggggtgt accgcaacga ccgcatacac aaggcggtga gcgccagccg 11400 

ccggcgcgag ctgagcgacc gcgagctttt gcacagcctg catcgggcgt tgactggtgc 11460 

cggcagcgcc gaggcggccg agtactttga cgccggagcg gacttgcgct ggcagccatc 11520 

ccgacgcgcg ctggaggcgg ctggcgtcgg ggagtacggg gtcgaggacg acgatgaagc 11580 

ggacgacgag ttgggcattg acttgtagcc gtttttcgtt agatatgtcg gcgaacgagc 11640 

cgtctgcggc cgccatggtg acggcggcgg gcgcgcccca ggacccggcc acgcgcgcgg 11700 

cgctgcagag tcagccttcc ggagtgacgc ccgcggacga ctggtccgag gccatgcgtc 11760 

gcatcctggc gctgacggcg cgcaaccccg aggcttttcg gcagcagccg caggcaaacc 11820 

ggtttgcggc cattttggaa gcggtggtgc cctccagacc caaccccacc cacgaaaagg 11880 

tgctggccat cgtcaacgcc ctggcggaga ccaaggccat ccgcccagac gaggccgggc 11940 

aggtttacaa cgcgctgcta gaaagggtgg gacgctacaa cagctccaac gtgcagacca 12000 

atctggaccg cttggtgacg gacgtgaagg aggccgtagc ccagcgagag cggtttttca 12060 

aggaagccaa tctgggctcg ctggtggccc tcaacgcctt cctgagcacg ctgccggcga 12120 

acgtgccccg cggtcaggag gactacgtga actttctgag cgccctccgc ctgatggtgg 1218 0 

ccgaggtgcc gcagagcgag gtgtaccagt ctggccccaa ctactacttc cagacctccc 12240 

ggcagggcct gcagacggta aacctgacgc aggcctttca gaacctgcag ggcctttggg 12300 

gggtgcgcgc tccgctgggc gaccgcagca cggtgtccag cctgctgacc cccaatgccc 12360 
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ggctgctctt 


gcttctcatt 


gctccgttca 


ccgacagcgg 


ttccatcagc 


cgcgactctt 


12420 


acctgggaca 


cctgctcacc 


ctgtaccggg 


aggecategg 


geaggegegg 


gtggacgagc 


12480 


agacgtacca 


ggaaatcacc 


agegtgagee 


gcgcgctggg 


gcaggaggac 


aegggcaget 


12540 


tggaggcgac 


tctgaacttc 


ctgctgacca 


accggcggca 


gcgcctacct 


ccccagtacg 


12600 


cgctgaacgc 


ggaggaggag 


cgcatcctgc 


gtttcgtgca 


gcagagcacc 


gcgctgtact 


12660 


tgatgcggga 


aggcgcctct 


cccagcgctt 


cgctggacat 


gacggcggcc 


aacatggagc 


12720 


catcgttcta 


cgccgccaac 


cgtcccttcg 


tcaaccggct 


aatggactat 


ttgcatcggg 


12780 


cggcggccct 


gaacccggaa 


tactttacta 


acgtcatcct 


gaacgaccgt 


tggctgccac 


12840 


ctcccggctt 


etacaegggg 


gagttcgacc 


tcccggaggc 


caacgaeggt 


ttcatgtggg 


12900 


acgacgtgga 


cagcgtgttc 


ctgcccggca 


agaaggaggc 


gggtgactct 


cagagccacc 


12960 


gcgcgagcct 


cgcagacctg 


ggggcgaccg 


ggcccgcgtc 


tccgctgcct 


cgcctgccga 


13020 


gcgccagcag 


cgccagcgtg 


gggcgggtga 


gccgtccgcg 


cctcagcggt 


gaggaggact 


13080 


ggtggaacga 


tccgctgctc 


cgtccggccc 


gcaacaaaaa 


cttccccaac 


aacgggatag 


13140 


aggatttggt 


agacaaaatg 


aaccgttgga 


agaegtatge 


ccaggagcat 


cgggagtggc 


13200 


aggcgaggca 


acccatgggc 


cctgttctgc 


cgccctctcg 


gcgcccgcgc 


agggacgaag 


13260 


acgccgacga 


ttcagecgat 


gaeagcageg 


tgttggatct 


gggegggage 


gggaacccct 


13320 


ttgcccacct 


gcaacctcgc 


ggcgtgggtc 


ggcggtggcg 


ctaggaaaaa 


aaattattaa 


13380 


aagcacttac 


cagagecatg 


gtaagaagag 


caacaaaggt 


gtgtcctgct 


ttcttcccgg 


13440 


tagcaaa atg cgt egg gcg gtg gca < 


gtt ccc tec 


gcg gca atg gcg tta 


13489 



10 



ggc ccg ccc cct tct tac gaa age gtg atg gca gcg gee acc ctg caa 13537 
Gly Pro Pro Pro Ser Tyr Glu Ser Val Met Ala Ala Ala Thr Leu Gin 
15 20 25 30 



gcg ccg ttg gag aat cct tac gtg ccg ccg cga tac ctg gag cct acg 
Ala Pro Leu Glu Asn Pro Tyr Val Pro Pro Arg Tyr Leu Glu Pro Thr 
35 40 45 



acc acc cgc ctg tac ctg gtg gac aac aag tea gca gat ate gee acc 
Thr Thr Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp He Ala Thr 
65 ~ 70 75 



13585 



ggc ggg aga aac age att cgt tac teg gag ctg acg ccc ctg tac gac 13633 
Gly Gly Arg Asn Ser He Arg Tyr Ser Glu Leu Thr Pro Leu Tyr Asp 
50 55 60 



13681 
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ttg aac tac cag aac gac cac age aac ttt etc acg tec gtg gtg cag 13729 
Leu Asn Tyr Gin Asn Asp His Ser Asn Phe Leu Thr Ser Val Val Gin 
80 ~ 85 90 

aac age gac tac acg ccc gee gaa gcg age acg cag ace att aac ttg 13777 
Asn Ser Asp Tyr Thr Pro Ala Glu Ala Ser Thr Gin Thr lie Asn Leu 
95 * 100 105 110 

gac gac cgc teg cgc tgg ggc ggg gac ttg aaa acc att ctg cac act 13825 
Asp Asp Arg Ser Arg Trp Gly Gly Asp Leu Lys Thr He Leu His Thr 
115 120 125 

aac atg ccc aac gtg aac gag ttc atg ttt acc aac teg ttc agg get 
Asn Met Pro Asn Val Asn Glu Phe Met Phe Thr Asn Ser Phe Arg Ala 
130 135 140 



gtg cag ctg acg ctg ccg gag ggg aac ttt tea gag att atg acc ata 
Val Gin Leu Thr Leu Pro Glu Gly Asn Phe Ser Glu He Met Thr He 
160 165 170 

gac ctg atg aac aac gee att ate gac cac tac ctg gcg gta gee aga 
Asp Leu Met Asn Asn Ala He He Asp His Tyr Leu Ala Val Ala Arg 
175 180 185 190 

cag cag ggg gtg aaa gaa age gag ate ggc gtc aag ttt gac acg cgc 
Gin Gin Gly Val Lys Glu Ser Glu He Gly Val Lys Phe Asp Thr Arg 
195 200 205 



tgc ggg gtg gac ttt acc tac age egg tta aac aac ctg eta ggc ata 
Cys Gly Val Asp Phe Thr Tyr Ser Arg Leu Asn Asn Leu Leu Gly He 
240 245 250 



13873 



aaa ctt atg gtg gcg cac gag gee gac aag gac ccg gtt tat gag tgg 13921 
Lys Leu Met Val Ala His Glu Ala Asp Lys Asp Pro Val Tyr Glu Trp 
145 150 155 



13969 



14017 



14065 



aac ttt cgt ctg ggc tgg gac ccg gag acg ggg ctt gtg atg ccg ggg 14113 
Asn Phe Arg Leu Gly Trp Asp Pro Glu Thr Gly Leu Val Met Pro Gly 
210 ' 215 220 

gtg tac acg aac gaa get ttc cat ccc gac gtg gtc etc ttg ccg ggc 14161 
Val Tyr Thr Asn Glu Ala Phe His Pro Asp Val Val Leu Leu Pro Gly 
225 230 235 



14209 



cgc aag aga atg ccc ttt cag gaa ggg ttt cag ate ctg tac gag gac 14257 
Arg Lys Arg Met Pro Phe Gin Glu Gly Phe Gin He Leu Tyr Glu Asp 
255 " 260 265 270 

ctg gag ggc ggt aac ate ccg gee ctg ctg gac gtg ccg gcg tac gag 14305 
Leu Glu Gly Gly Asn He Pro Ala Leu Leu Asp Val Pro Ala Tyr Glu 
275 280 285 

gag age ate gee aac gca agg gag gcg gcg ate agg ggc gat aat ttc 14353 
Glu Ser He Ala Asn Ala Arg Glu Ala Ala He Arg Gly Asp Asn Phe 
290 295 300 
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gcg gcg cag ccc cag gcg get cca acc ata aaa ccc gtt ttg gaa gac 
Ala Ala Gin Pro Gin Ala Ala Pro Thr lie Lys Pro Val Leu Glu Asp 
305 310 315 

tec aaa ggg egg age tac aac gta ata gee aac acc aac aac acg get 
Ser Lys Gly Arg Ser Tyr Asn Val lie Ala Asn Thr Asn Asn Thr Ala 
320 " 325 330 

tac agg age tgg tat ctg get tat aac tac ggc gac ccg gag aag ggg 
Tyr Arg Ser Trp Tyr Leu Ala Tyr Asn Tyr Gly Asp Pro Glu Lys Gly 
335 " 340 345 350 

gtt agg gee tgg acc ctg etc acc act ccg gac gtg acg tgc ggt tea 
Val Arg Ala Trp Thr Leu Leu Thr Thr Pro Asp Val Thr Cys Gly Ser 
355 360 365 

gag cag gtc tac tgg teg ctg cct gac atg tac gtg gac cct gtg acg 
Glu Gin Val Tyr Trp Ser Leu Pro Asp Met Tyr Val Asp Pro Val Thr 
370 375 380 

ttt cgc tec acg cag caa gtt age aac tac cca gtg gtg gga gcg gag 
Phe Arg Ser Thr Gin Gin Val Ser Asn Tyr Pro Val Val Gly Ala Glu 
385 390 395 

ctt atg ccg att cae age aag age ttt tac aac gag cag gee gtc tac 
Leu Met Pro He His Ser Lys Ser Phe Tyr Asn Glu Gin Ala Val Tyr 
400 405 410 

tea cag etc att cgt cag acc acc gec eta acg cae gtt ttc aac cgc 
Ser Gin Leu He Arg Gin Thr Thr Ala Leu Thr His Val Phe Asn Arg 
415 420 425 430 

ttc ccc gag aac caa ate eta gtg cga cct cca gcg ccc acc ate acc 
Phe Pro Glu Asn Gin He Leu Val Arg Pro Pro Ala Pro Thr He Thr 
435 440 445 

acc gtc age gag aac gtg ccc get eta acc gat cae ggg acg ctg cct 
Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr Leu Pro 
450 455 460 

ttg cag aac age ate cgc gga gtt cag cga gtt acc ate acg gac gee 
Leu Gin Asn Ser He Arg Gly Val Gin Arg Val Thr He Thr Asp Ala 
465 470 475 

cgt cgt egg acc tgt ccc tac gtc tac aaa gee ttg gga ate gtg gee 
Arg Arg Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly He Val Ala 
480 485 490 

ccg cgc gtc ctg teg agt cgc act ttc tag atgtccatcc tcatctctcc 
Pro Arg Val Leu Ser Ser Arg Thr Phe 
495 500 

cagcaacaat accggttggg gtctgggcgt gaccaaaatg tacggaggcg ccaaacgacg 
gtccccacaa catcccgtgc gagtgegegg gcactttaga gecccatggg ggtcgcacac 
gcgcgggcgc accggccgaa ccaccgtcga egaegtgate gatagcgtgg tggccgacgc 



14401 

14449 

14497 

14545 

14593 

14641 

14689 

14737 

14785 

14833 

14881 

14929 

14979 

15039 
15099 
15159 
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ccgcaactac 


cagcccgctc 


gatccacggt 


ggacgaagtc 


atcgacggcg 


tggtggccga 


15219 


cgccagggcc 


tacgcccgca 


gaaagtctcg 


tctgcgccgc 


cgccgttcgc 


taaagcgccc 


15279 


cacggccgcc 


atgaaagccg 


ctcgctctct 


gctgcgtcgc 


gcacgtatcg 


tgggtcgccg 


15339 


cgccgccaga 


cgcgcagccg 


ccaacgccgc 


cgccggccga 


gtgcgccgcc 


gggccgccca 


15399 


gcaggccgcc 


gccgccatct 


ccagtctatc 


cgccccccga 


cgcgggaatg 


tgtactgggt 


15459 


cagggactcg 


gccaccggcg 


tgcgagttcc 


cgtgagaacc 


cgtcctcctc 


gtccctgaat 


15519 


aaaaagttct 


aagcccaatc 


ggtgttccgt 


tgtgtgttca 


gctcgtcatg 


accaaacgca 


15579 


agtttaaaga 


ggagctgctg 


caagcgctgg 


tccccgaaat 


ctatgcgccg 


gcgccggacg 


15639 


tgaaaccgcg 


tcgcgtgaaa 


cgcgtgaaga 


agcaggaaaa 


gctagagaca 


aaagaggagg 


15699 


cggtggcgtt 


gggagacggg 


gaggtggagt 


ttgtgcgctc 


gttcgcgccg 


cgtcggcgag 


15759 


tgaattggaa 


ggggcgcaag 


gtgcaacggg 


tgctgcgtcc 


cggcacggtg 


gtgtctttca 


15819 


ccccgggtga 


aaaatccgcc 


tggaagggca 


taaagcgcgt 


gtacgatgag 


gtgtacgggg 


15879 


acgaagacat 


tctggagcag 


gcgctggata 


gaagcgggga 


gtttgcttac 


ggcaagaggg 


15939 


cgaggacggg 


cgagatcgcc 


atcccgctgg 


acacttccaa 


ccccaccccc 


agtctgaaac 


15999 


ccgtgacgct 


gcaacaggtg 


ttgccggtga 


gcgccccctc 


gcgacgcggc 


ataaaacgcg 


16059 


agggcggcga 


gctgcagccc 


accatgcagc 


tcctggttcc 


caagaggcag 


aaactagagg 


16119 


acgtactgga 


catgataaaa 


atggagcccg 


acgtgcagcc 


cgatattaaa 


atccgtccca 


16179 


tcaaagaagt 


ggcgccggga 


atgggcgtgc 


agaccgtgga 


catccagatt 


cccatgacca 


16239 


gcgccgcaca 


ggcggtagag 


gccatgcaga 


ccgacgtggg 


gatgatgacg 


gacctgcccg 


16299 


cagctgctgc 


cgccgtggcc 


agcgccgcga 


cgcaaacgga 


agccggcatg 


cagaccgacc 


16359 


cgtggacgga 


ggcgcccgtg 


cagccggcca 


gaagacgcgt 


cagacggacg 


tacggccccg 


16419 


tttctggcat 


aatgccggag 


tacgcgctgc 


atccttccat 


catccccacc 


cccggctacc 


16479 


gggggcgcac 


ctaccgtccg 


cgacgcagca 


ccactcgccg 


ccgtcgccgc 


acggcacgag 


16539 


tcgccaccgc 


cagagtgaga 


cgcgtaacga 


cacgtcgcgg 


ccgccgcttg 


accctgcccg 


16599 


tggtgcgcta 


ccatcccagc 


attctttaaa 


aaaccgctcc 


tacgttgcag 


atgggcaagc 


16659 


ttacttgtcg 


actccgtatg 


gccgtgcccg 


gctaccgagg 


aagatcccgc 


cgacgacgga 


16719 


ctttgggagg 


cagcggtttg 


cgccgccgtc 


gggcggttca 


ccggcgcctc 


aagggaggca 


16779 


ttctgccggc 


cctgatcccc 


ataatcgccg 


cagccatcgg 


ggccattccc 


ggaatcgcca 


16839 



195 



WO 03/046124 



PCT/US02/33645 



t~T /—I t-t 4~ ^3 /~X <">T /T+" 

g Cy Cdy cy y c 


rtr*2*rrrrr*\~z* pit* 
y Lay y n-ayu 


rarfrrfccact 

1 — d. y i — . y ^ v-« v— 


ga 1. 1 ttac ta 


accctgtcgg 


tcgcgccgtc 


16899 
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agaca tea a t 


ttctcctctc 


tqcfccccgcg 


16959 


yCdcggcdcg 


c-yyc-c-y lolo 


■h rrrr rr rra r*n"f" rr 


gagegagate 


ggcacga a cc 


agatgaaegg 


17019 




aaLLyyay L.y 


rrtcrtat cfCfacr 
y Ly LyLyyay 


egge t tgaaa 


aatttcggtt 


ccactctgaa 


17079 


ad.OLLaL.yyu 


aaL.L>y y y i— y i— 


rrcra a ctccacf 

y y «. ci <— » <— u. 


cacaacTCf caa 


atcrctaaqqcf 


acaagctaaa 


17139 


rrrra r^^i p* pt p* a pt 
y y aL-auy L.ay 


tt t <~*a ocaaa 


aggtggtgga 


eggea teget 


tcgggcctca 


acggcgccgt 


17199 
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fcfc caaaacrcra 


aa t taacagc 


cgcctggagc 


cgcggccgca 


17259 
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gc tgcccccc 


aagggagaga 


aqccfcccgcg 
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17319 
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"t"arft"t"artaa 


gagegaggag 


ccgcca teat 


acgaggaggc 


17379 
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accacct at c 


ccatgaccaa 


17439 
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cccat cgacg 


cggtggccac 


17499 
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ccgcg tgee t 


cccgtcccca 


tcgctccgcc 


17559 


ggtttctcgc 


cccgccatcc 


gccccgtcgc 


cgtggccact 


ccccgctatc 


cgagccgcaa 


17619 


cgccaactgg 


cagaccaccc 


tcaacagtat 


tgtcggactg 


ggggtgaagt 


etctgaageg 


17679 


ccgtcgctgt 


ttttaaagca 


caatttatta 


aacgagtagc 


cctgtcttaa 


tccatcgttg 


17739 


tatgtgtgcc 


tatatcacgc 


gttcagagcc 


tgaccgtccg 


tcaag atg gec act ccg 
Met Ala Thr Pro 


17796 



505 

teg atg atg ccg cag tgg teg tac atg cac ate gee ggg cag gac gee 17844 

Ser Met Met Pro Gin Trp Ser Tyr Met His lie Ala Gly Gin Asp Ala 
510 ~ 515 520 

teg gag tac ctg age ccg ggt ctg gtg cag ttt gee cgt gcg acg gaa 17892 

Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala Arg Ala Thr Glu 

525 530 535 



acc tac ttc tea ctg ggc aac aag ttc agg aac ccc acc gtg gcg ccc 

Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro Thr Val Ala Pro 

540 545 550 555 

acc cac gac gtc acc acc gat egg tec cag cga ctg aca ate cgc ttc 

Thr His Asp Val Thr Thr Asp Arg Ser Gin Arg Leu Thr lie Arg Phe 

560 565 570 

gtc ccc gtg gac aag gaa gac acc get tac tec tac aaa acc cgc ttc 

Val Pro Val Asp Lys Glu Asp Thr Ala Tyr Ser Tyr Lys Thr Arg Phe 

575 ' 580 585 

acg ctg gee gtg ggc gac aac egg gtg eta gac atg gee agt acc tac 

Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr 

590 ~ ~ 595 600 



17940 



17988 



18036 



18084 
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ttt gac ate cgc ggc gtg ate gac cgc gga cct age ttc aag cct tac 18132 
Phe Asp lie Arg Gly Val lie Asp Arg Gly Pro Ser Phe Lys Pro Tyr 
605 610 615 

tec ggc acg get tac aac tea ctg get cec aaa ggg gcg ccc aac aac 18180 
Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys' Gly Ala Pro Asn Asn 
620 625 630 635 

age caa tgg aac gec aca gat aac ggg aac aag cca gtg tgt ttt get 
Ser Gin Trp Asn Ala Thr Asp Asn Gly Asn Lys Pro Val Cys Phe Ala 
640 645 650 

cag gca get ttt ata ggt caa age att aca aaa gac gga gtg caa ata 
Gin Ala Ala Phe lie Gly Gin Ser lie Thr Lys Asp Gly Val Gin lie 
655 660 665 

cag aac tea gaa aat caa cag get get gee gac aaa act tac caa cea 
Gin Asn Ser Glu Asn Gin Gin Ala Ala Ala Asp Lys Thr Tyr Gin Pro 
670 675 680 

gag cct caa att gga gtt tec ace tgg gat ace aac gtt ace agt aac 
Glu Pro Gin He Gly Val Ser Thr Trp Asp Thr Asn Val Thr Ser Asn 
685 690 695 

get gec gga cga gtg tta aaa gee ace act ccc atg ctg cca tgt tac 
Ala Ala Gly Arg Val Leu Lys Ala Thr Thr Pro Met Leu Pro Cys Tyr 
700 ~ 705 710 715 

ggt tea tat gec aat ccc act aat cca aac ggg ggt cag gca aaa aca 
Gly Ser Tyr Ala Asn Pro Thr Asn Pro Asn Gly Gly Gin Ala Lys Thr 
720 725 730 

gaa gga gac att teg eta aac ttt ttc aca aca act gcg gca gca gac 
Glu Gly Asp He Ser Leu Asn Phe Phe Thr Thr Thr Ala Ala Ala Asp 
735 740 745 

aat aat ccc aaa gtg gtt ctt tac age gaa gat gta aac ctt caa gec 
Asn Asn Pro Lys Val Val Leu Tyr Ser Glu Asp Val Asn Leu Gin Ala 
750 ~ 755 760 

ccc gat act cac tta gta tat aag cca acg gtg gga gaa aac gtt ate 
Pro Asp Thr His Leu Val Tyr Lys Pro Thr Val Gly Glu Asn Val He 
765 770 775 

gec gca gaa gee ctg eta acg cag cag gcg tgt ccc aac aga gca aac 
Ala Ala Glu Ala Leu Leu Thr Gin Gin Ala Cys Pro Asn Arg Ala Asn 
780 785 790 795 

tac ata ggt ttc cga gat aac ttt ate ggt tta atg tat tat aac age 
Tyr He Gly Phe Arg Asp Asn Phe He Gly Leu Met Tyr Tyr Asn Ser 
8 00 8 05 810 



18228 



18276 



18324 



18372 



18420 



18468 



18516 



18564 



18612 



18660 



18708 



aca ggg aac atg gga gtt ctg gca ggt cag gee teg cag tta aac gca 18756 
Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu Asn Ala 
815 820 825 
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gtt gta gac ctg caa gat cga aac acg gaa ctg tec tat cag eta atg 
Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Met 
830 835 840 



18804 



eta gat get ctg ggt gac aga act cga tat ttc tea atg tgg aat cag 18852 
Leu Asp Ala Leu Gly Asp Arg Thr Arg Tyr Phe Ser Met Trp Asn Gin 
845 850 855 



gee gtg gac age tac gat cca gac gtt agg att ate gag aac cat ggg 
Ala Val Asp Ser Tyr Asp Pro Asp Val Arg lie He Glu Asn His Gly 
860 865 870 875 

gtg gaa gac gag ctg ccc aat tac tgt ttt cca etc cca ggc atg ggt 
Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Pro Gly Met Gly 
880 885 890 

att ttt aac tec tac aag ggg gta aaa cca caa aat ggc ggt aat ggt 
He Phe Asn Ser Tyr Lys Gly Val Lys Pro Gin Asn Gly Gly Asn Gly 
895 900 905 

aac tgg gaa gca aac ggg gac eta tea aat gee aat gag ate get tta 
Asn Trp Glu Ala Asn Gly Asp Leu Ser Asn Ala Asn Glu He Ala Leu 
910 ~ 915 920 

gga aac att ttt gee atg gaa att aac etc cac gca aac ctg tgg cgc 
Gly Asn lie Phe Ala Met Glu He Asn Leu His Ala Asn Leu Trp Arg 
925 930 935 

age ttc ttg tac age aat gtg gcg ctg tac ctg cca gac age tat aaa 
Ser Phe Leu Tyr Ser Asn Val Ala Leu Tyr Leu Pro Asp Ser Tyr Lys 
940 945 950 955 

ttc act ccc get aac ate act ctg ccc gee aac caa aac ace tac gag 
Phe Thr Pro Ala Asn He Thr Leu Pro Ala Asn Gin Asn Thr Tyr Glu 
960 965 970 

tat ate aac ggg cgc gtc act tct cca ace ctg gtg gac ace ttt gtt 
Tyr He Asn Gly Arg Val Thr Ser Pro Thr Leu Val Asp Thr Phe Val 
975 980 985 



18900 



18948 



18996 



19044 



19092 



19140 



19188 



19236 



aac att gga gee cga tgg teg ccg gat ccc atg gac aac gtc aac ccc 19284 
Asn He Gly Ala Arg Trp Ser Pro Asp Pro Met Asp Asn Val Asn Pro 
990 ~ 995 1000 

ttt aac cat cac egg aac gcg ggc etc cgt tac cgc tec atg ctg 19329 
Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu 
1005 1010 1015 

ctg gga aat gga cgc gtg gtg cct ttc cac ata caa gtg ccg caa 19374 
Leu Gly Asn Gly Arg Val Val Pro Phe His He Gin Val Pro Gin 
1020 1025 1030 

aaa ttt ttc gcg att aag aac etc ctg ctt ttg ccc ggc tec tac 19419 
Lys Phe Phe Ala He Lys Asn Leu Leu Leu Leu Pro Gly Ser Tyr 
1035 1040 1045 
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act tac gag 
Thr Tyr Glu 
1050 

age acc ctg 
Ser Thr Leu 
1065 

att gac age 
lie Asp Ser 
1080 

aac acc get 
Asn Thr Ala 
1095 

gac cag teg 
Asp Gin Ser 
1110 

ccc att ccg 
Pro lie Pro 
1125 

cgc aac tgg 
Arg Asn Trp 
1140 

gee aag gaa 
Ala Lys Glu 
1155 

gtg tac tea 
Val Tyr Ser 
1170 

aac cac act 
Asn His Thr 
1185 

agt tgg ccg 
Ser Trp Pro 
1200 

att aag cgc 
lie Lys Arg 
1215 

aac atg acc 
Asn Met Thr 
1230 

aac ate ggc 
Asn lie Gly 
1245 



tgg age ttc 
Trp Ser Phe 



ggc aat gat 
Gly Asn Asp 



gtc aac ttg 
Val Asn Leu 



tct acc ttg 
Ser Thr Leu 



ttt aac gac 
Phe Asn Asp 



gec aac gee 
Ala Asn Ala 



gcg gee ttc 
Ala Ala Phe 



acg cct tec 
Thr Pro Ser 



ggc acc att 
Gly Thr lie 



ttc aaa cgt 
Phe Lys Arg 



ggc aac gac 
Gly Asn Asp 



att gtg gac 
He Val Asp 



aaa gac tgg 
Lys Asp Trp 



tac caa ggc 
Tyr Gin Gly 



aga aaa gac 
Arg Lys Asp 
1055 

ctt cga gtg 
Leu Arg Val 
1070 

tac gee aac 
Tyr Ala Asn 
1085 

gaa gee atg 
Glu Ala Met 
1100 

tac etc age 
Tyr Leu Ser 
1115 

acc aac gtt 
Thr Asn Val 
1130 

egg gga tgg 
Arg Gly Trp 
1145 

ttg ggc tec 
Leu Gly Ser 
1160 

cct tac ctg 
Pro Tyr Leu 
1175 

ctg tec ate 
Leu Ser He 
1190 

cgc etc ctg 
Arg Leu Leu 
1205 

ggg gaa ggc 
Gly Glu Gly 
1220 

ttt tta att 
Phe Leu He 
1235 

ttc tat gtt 
Phe Tyr Val 
1250 



gtg aac atg 
Val Asn Met 
1060 

gac ggg gee 
Asp Gly Ala 
1075 

ttt ttc ccc 
Phe Phe Pro 
1090 

ctg cga aac 
Leu Arg Asn 
1105 

gcg gee aac 
Ala Ala Asn 
1120 

ccc att tec 
Pro lie Ser 
1135 

age ttc acc 
Ser Phe Thr 
1150 

ggc ttt gac 
Gly Phe Asp 
1165 

gac ggc age 
Asp Gly Ser 
1180 

atg ttc gat 
Met Phe Asp 
1195 

acg ccg aac 
Thr Pro Asn 
1210 

tac aac gtg 
Tyr Asn Val 
1225 

caa atg etc 
Gin Met Leu 
1240 

ccc gag ggc 
Pro Glu Gly 
1255 



att ctg cag 
lie Leu Gin 



age gtc cgc 
Ser Val Arg 



atg gcg cac 
Met Ala His 



gac acc aac 
Asp Thr Asn 



atg ctt tat 
Met Leu Tyr 



att ccc tec 
He Pro Ser 



cgc ctt aaa 
Arg Leu Lys 



ccc tac ttt 
Pro Tyr Phe 



ttt tac etc 
Phe Tyr Leu 



tct tec gta 
Ser Ser Val 



gag ttc gaa 
Glu Phe Glu 



get caa agt 
Ala Gin Ser 



age cac tac 
Ser His Tyr 



tac aag gat 
Tyr Lys Asp 



19464 



19509 



19554 



19599 



19644 



19689 



19734 



19779 



19824 



19869 



19914 



19959 



20004 



20049 
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egg atg tat tct ttc ttc cga aac ttt cag ccc atg age 
Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser 
1260 1265 1270 

gtg ccg gat ccc acc get gcc ggc tat caa gcc gtt ccc 
Val Pro Asp Pro Thr Ala Ala Gly Tyr Gin Ala Val Pro 
1275 1280 1285 

aga caa cac aac aac teg ggc ttt gtg ggg tac atg ggc 
Arg Gin His Asn Asn Ser Gly Phe Val Gly Tyr Met Gly 
1290 1295 1300 

atg cgc gaa gga cag cca tac ccg gcc aac tac ccc tat 
Met Arg Glu Gly Gin Pro Tyr Pro Ala Asn Tyr Pro Tyr 
1305 1310 1315 

ate ggc get acc gcc gtc ccc gcc att acc cag aaa aag 
He Gly Ala Thr Ala Val Pro Ala He Thr Gin Lys Lys 
1320 1325 1330 

tgc gac cgc gtc atg tgg cgc ata cct ttt tec age aac 
Cys Asp Arg Val Met Trp Arg He Pro Phe Ser Ser Asn 
1335 " 1340 1345 

tea atg ggg gcc ctg acc gac etc gga cag aac atg ctt 
Ser Met Gly Ala Leu Thr Asp Leu Gly Gin Asn Met Leu 
1350 ~ 1355 1360 

aac tec gcc cat gcc ctg gat atg act ttt gag gtg gac 
Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu Val Asp 
1365 1370 1375 

aac gag ccc acg ttg ctg tac atg ctt ttt gag gtg ttc 



cgc cag 
Arg Gin 



ctg ccc 
Leu Pro 



ccg acc 
Pro Thr 



ccc ctg 
Pro Leu 



ttt ttg 
Phe Leu 



ttt atg 
Phe Met 



tac get 
Tyr Ala 



ccc atg 
Pro Met 



gac gtg 
Asp Val 



20094 



20139 



20184 



20229 



20274 



20319 



20364 



20409 



20454 



1380 




1385 




1390 






gtc aga gtg cac cag ccg cac cgc 
Val Arg Val His Gin Pro His Arg 
1395 1400 


ggt att ate gag gcc 
Gly He He Glu Ala 
14 05 


gtg tac 
Val Tyr 


20499 


ctg cgc acc ccc ttc tct gcg ggc aat gcc acc aca taa 
Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr 
1410 1415 1420 


gccgctgaac 


20548 


tagctggttt 


ttaccccaga 


tcccatgggc 


tecaeggaag 


aegaactgeg 


ggccattgtg 


20608 


cgagacctgg 


gctgcggacc 


ctacttcctg 


ggcacctttg 


acaageggtt 


tcccgggttc 


20668 


gtgtctcctc 


gcaaactcgc 


gtgegegate 


gtgaataccg 


ccggccgaga 


gaeeggagga 


20728 


gagcattggc 


tagetctggg 


ctggaacccc 


cgctcgtcca 


cgtttttcct 


gttcgacccc 


20788 


tttggctttt 


cagaccaacg 


cttgaagcag 


atetatgeat 


ttgaatatga 


gggtctactc 


20848 


aagcgaagcg 


cgctggcctc 


ctccgccgat 


cactgtctaa 


ccctggtaaa 


gagcactcag 


20908 


aeggttcagg 


gccctcacag 


cgccgcctgt 


ggcctttttt 


gttgcatgtt 


tttgcacgcc 


20968 
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tttgtgaact 


ggccggacac 


ccccatggaa 


aacaacccca 


ccatggacct 


cctgactggc 


21028 


gttcccaact 


ccatgctcca 


aagccccagc 


gtgcagacca 


ccctcctcca 


aaaccagaaa 


21088 


aatctgtacg 


cctttctgca 


caagcactct 


ccctactttc 


gccgccatcg 


ggaacaaata 


21148 


gaaaatgcaa 


ccgcgtttaa 


caaaactctg 


taacgtttaa 


taaatgaact 


ttttattgaa 


21208 


ctggaaaacg 


ggtttgtgat 


ttttaaaaat 


caaaggggtt 


gagctggaca 


tccatgtggg 


21268 


aggccggaag 


ggtggtgttc 


ttgtactggt 


acttgggcag 


ccacttaaac 


tctggaatca 


21328 


caaacttggg 


cagcggtatt 


tctgggaagt 


tgtcgtgcca 


cagctggcgg 


gtcagctgaa 


21388 


gtgcctgcag 


aacatcgggg 


gcggagatct 


tgaagtcgca 


gtttatctgg 


ttcacggcac 


21448 


gcgcgttgcg 


gtacatggga 


ttggcacact 


gaaacaccag 


caggctggga 


ttcttgatgc 


21508 


tagccagggc 


cacggcgtcg 


gtcacgtcac 


cggtgtcttc 


tatgttggac 


agcgaaaaag 


21568 


gcgtgacttt 


gcaaagctgg 


cgtcccgcgc 


gaggcacgca 


atctcccagg 


tagttgcact 


21628 


cacagcggat 


gggcagaaga 


agatgcttgt 


ggccgcgggt 


catgtaggga 


taggccgctg 


21688 


ccataaaagc 


ttcgatctgc 


ctgaaagcct 


gcttggcctt 


gtgcccttcg 


gtataaaaaa 


21748 


caccgcagga 


cttgttggaa 


aaggtattac 


tggcgcaagc 


ggcatcgtga 


aagcaagcgc 


21808 


gtgcgtcttc 


gtttcgtaac 


tgcaccacgc 


tgcggcccca 


ccggttctga 


atcaccttgg 


21868 


ccctgccggg 


gttttccttg 


agagcgcgct 


ggccggcttc 


gctgcccaca 


tccatttcca 


21928 


cgacatgctc 


cttgttaatc 


atggccagac 


cgtggaggca 


gcgcagctcc 


tcgtcatcgt 


21988 


cggtgcagtg 


atgctcccac 


acgacgcagc 


cagtgggctc 


ccacttgggc 


ttggaggcct 


22048 


cggcaatgcc 


agaatacagg 


agaacgtagt 


ggtgcagaaa 


acgtcccatc 


atggtgccaa 


22108 


aggttttctg 


gctgctgaag 


gtcatcgggc 


agtacctcca 


gtcctcgtta 


agccaagtgt 


22168 


tgcagatctt 


cctgaagacc 


gtgtactgat 


cgggcataaa 


» gtggaactca 


ttgcgctcgg 


22228 


tcttgtcgat 


cttatacttt 


tccatcagac 


tatgcataat 


ctccatgccc 


ttttcccagg 


22288 


cgcaaacaat 


cttggtgcta 


cacgggttag 


gtatggccaa 


agtggttggc 


ctctgaggcg 


22348 


gcgcttgttc 


ttcctcttga 


gccctctccc 


gactgacggg 


ggttgaaaga 


gggtgcccct 


22408 


tggggaacgg 


cttgaacacg 


gtctggcccg 


aggcgtcccg 


aagaatctgc 


atcgggggat 


22468 


tgctggccgt 


catggcgatg 


atctgacccc 


ggggctcctc 


cacttcgtcc 


tcctcgggac 


22528 


tttcctcgtg 


cttttcgggg 


gacggtacgg 


gagtaggggg 


aagagcgcgg 


cgcgccttct 


22588 


tcttgggcgg 


cagttccgga 


gcctgctctt 


gacgactggc 


cattgtcttc 


tcctaggcaa 


22648 
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gaaaaacaag atggaagact ctttctcctc ctcctcgtca acgtcagaaa gcgagtcttc 22708 
caccttaagc gccgagaact cccagcgcat agaatccgat gtgggctacg agactccccc 22768 
cgcgaacttt tcgccgcccc ccataaacac taacgggtgg acggactacc tggccctagg 22828 
agacgtactg ctgaagcaca tcaggcggca gagcgttatc gtgcaagatg ctctcaccga 22888 
gcgactcgcg gttccgctgg aagtggcgga acttagcgcc gcctacgagc gaaccctctt 22948 
ctccccaaag actcccccca agaggcaggc taacggcacc tgcgagccta accctcgact 23008 
caacttctac cctgcctttg ccgtgccaga ggtactggct acgtaccaca tttttttcca 23068 
aaaccacaaa atccctctct cgtgccgcgc caaccgcacc aaagccgatc gcgtgctgcg 23128 
actggaggaa ggggctcgca tacctgagat tgcgtgtctg gaggaagtcc caaaaatctt 23188 
tgaaggtctg ggccgcgacg aaaagcgagc agcaaacgct ctggaagaga acgcagagag 23248 
tcacaacagc gccttggtag aaotcgaggg cgacaacgcc agactggccg tcctcaaacg 23308 
gtccatagaa gtcacgcact tcgcctaccc cgccgttaac ctccctccaa aagttatgac 23368 
agcggtcatg gactcgctgc tcataaagcg cgctcagccc ttagacccag agcacgaaaa 23428 
caacagtgac gaaggaaaac cggtggtttc tgatgaggag ttgagcaagt ggctgtcctc 23488 
caacgacccc gccacgttgg aggaacgaag aaaaaccatg atggccgtgg tgctagttac 23548 
cgtgcaatta gaatgtctgc agaggttctt ttcccaccca gagaccctga gaaaagtgga 23608 
ggaaacgctg cactacacat ttaggcacgg ctacgtgaag caagcctgca agatttccaa 23668 
cgtagaactt agcaacctca tctcctacct ggggatcttg cacgaaaacc gcctcggaca 23728 
aaacgtgctg cacagcacac tgaaaggaga agcccgccga gactatgtgc gagactgcgt 23788 
gttcctagcg ctagtgtaca cctggcagag cggaatggga gtctggcagc agtgcctgga 23848 
ggacgaaaac ctoaaagagc ttgaaaagct gctggtgcgc tccagaaggg cactgtggac 23908 
cagttttgac gagcgcaccg ccgcgcgaga cctagctgat attatttttc ctcccaagct 23968 
ggtgcagact ctccgggaag gactgccaga ttttatgagt caaagcatct tgcaaaactt 24028 
ccgctctttc atcttggaac gctcgggaat cttgcccgcc actagctgcg ccctacccac 24088 
agattttgtg cbtctccact accgcgaatg cccaccgccg ctgtggccgt acacttactt 24148 
gcttaaactg gccaactttc taatgttcca ctctgacctg gcagaagacg ttagcggcga 24208 
ggggctgcta gaatgccact gccgctgcaa cctgtgcacc ccccaccgct ctctagtatg 24268 
caacactccc ctgctcaatg agacccagat catcggtacc tttgaaatcc agggaccctc 24328 
cgacgcggaa aacggcaagc aggggtctgg gctaaaactc acagccggac tgtggacctc 24388 
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cgcctacttg cgcaaatttg taccagaaga ctatcacgcc caccaaatta aattttacga 24448 
aaaccaatca aaaccaccca aaagcgagtt aacggcttgc gtcattacgc agagcagcat 24508 
agttgggcag ttgcaagcca ttaacaaagc gcggcaagag tttctcctaa aaaaaggaaa 24568 
aggggtctac ttggaccccc agaccggcga ggaactcaac ggaccctcct cagtcgcagg 24628 
ttgtgtgccc catgccgccc aaaaagaaca cctcgcagtg gaacatgcca gagacggagg 2468 8 
aagaggagtg gagcagtgtg agcaacagcg aaacggagga agagccgtgg cccgaggggt 24748 
gcaacgggga agaggacacg gagggacggc gaagtcttcg ccgaagaact ctcgccgctg 24808 
cccccgaagt cccagccggc cgcctcggcc caagatcccg cacacacccg tagatgggat 24868 
agcaagacca aaaagccggg taagagaaac gctcgccccc gccagggcta ccgctcgtgg 24928 
agaaagcaca aaaactgcat cttatcgtgc ttgctccagt gcggcggaga cgtttcgttc 2498 8 
acccgtagat acttgctttt taacaaaggg gtggccgtcc cccgtaacgt cctccactac 25048 
taccgtcact cttacagctc cgaagcggac ggctaagaaa acgcagcagt tgccggcggg 25108 
aggactgcgt ctcagcgccc gagaaccccc agccaccagg gagctccgaa accgcatatt 25168 
tcccaccctc tacgctatct ttcagcaaag ccgggggcag cagcaagaac tgaaaataaa 25228 
aaaccgcacg ctgaggtcgc ttacccgaag ctgcctctat cacaagagcg aagagcagct 25288 
gcagcgaacc ctggaggacg cagaagcgct gttccagaag tactgcgcga ccaccctaaa 25348 
taactaaaaa agcccgcgcg cgggacttca aaccgtctga cgtcaccagc cgcgcgccaa 25408 
aatgagcaaa gagattccca cgccttacat gtggagttac cagccgcaga tgggattagc 25468 
cgccggcgcc gcccaggatt actccacgaa aatgaactgg ctcagcgccg ggccccacat 25528 
gatttcccgc gtaaacgaca ttcgcgccca ccgcaatcag ctattgttag aacaggctgc 25588 
tctgaccgcc acgccccgta ataacctgaa ccctcccagc tggccagctg ccctggtgta 25648 
ccaggaaacg cctccaccca ccagcgtact tttgccccgt gacgcccagg cggaagtcca 25708 
gatgactaac gcgggcgcgc aattagcggg cggatcccgg tttcggtaca gagttcacgg 25768 
cgccgcaccc tatagcccag gtataaagag gctgatcatt cgaggcagag gtgtccagct 25828 
caacgacgag acagtgagct cttcgcttgg tctacgacca gacggagtgt tccagctcgc 25888 
gggctcgggc cgctcttcgt tcacgcctcg ccaggcatac ctgactctgc agagctctgc 25948 
ctctcagcct cgctcgggag gaatcggacc ccttcagttt gtggaggagt ttgtgccctc 26008 
ggtctacttt cagcctttct ccggatcgcc cggccagtac ccggacgagt tcatccccaa 26068 
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cttcgacgcg gtgagtgact ctgtggacgg ttatgactga tgtcgagccc gcttcagtgc 26128 
tagtggaaca agcgcggctc aatcacctgg ttcgttgccg ccgccgctgc tgcgtggctc 2 618 8 
gcgacttgag cttagctctc aagtttgtaa aaaacccgtc cgaaaccggg agcgctgtgc 26248 
acgggttgga gctagtgggt cctgagaagg ccaccatcca cgttctcaga aactttgtgg 26308 
aaaaacccat tttggttaaa cgagatcagg ggccttttgt aatcagctta ctctgcacct 2 6368 
gtaaccatgt tgaocttcac gactatttta tggatcattt gtgcgctgaa ttcaataagt 26428 
aaagcgaatt cttaccaaga ttatgatgtc catgactgtt cctcgccact atacgatgtt 26488 
gtgccagtaa actctcttgt cgacatctat ctgaactgtt octtttggtc cgcacagctt 2 654 8 
acttggtact acggtgacac cgtcctttct ggctcactgg gcagctcaca cggaataaca 26608 
cttcacctct tttcgccgtt tcgatacgga aactacagct gtcgtgccgg tacctgcctc 26668 
cacgttttca atcttcagcc ctgtccaccg accaaacttg tatttgtcga ctctaagcac 26728 
ttacagctca actgcagcat tctaggcccc agtatcttgt ggacatacaa taaaatcagg 26788 
ttggtggaat ttgtctacta cccacccagc gcccgcggtt ttggggaaat tcctttccag 26848 
atctactaca actatcttgc cacacattat gcaagtcaac agcaactaaa cttgcaagca 26908 
cccttcacgc caggagagta ctcctgtcac gtaggctcct gcacagaaac ttttattctc 26968 
ttcaacagat cttctgccat tgaacgcttc actactaact actttagaaa ccaagttgtg 27028 
cttttcactg acgaaacccc taacgtcacc ctggactgtg catgtttttc tcatgacacc 27088 
gtaacttgga ctcttaacaa tactctctgg ctcgcgttcg ataaccaaag cttgattgtt 27148 
aaaaattttg atttaacctt tactaaaccc tctcctcgcg aaatagttat ctttgctcct 27208 
tttaatccaa aaactacctt agcctgtcag gttttgttta agccttgcca aacaaacttt 27268 
aagtttgttt atttgcctcc gcaatctgtc aaactcatag aaaaatacaa caaagcgccc 27328 
gtcttggctc ctaaaacctt ctaccactgg ctaacctaca cggggctgtt tgcactaatt 27388 
gtttttttcc taattaacat ttttatatgt ttcttgcctt cctccttctt ttcgcgaaca 27448 
ccgttgccgc agaaagacct ctccttatta ctgtagcgct tgctatacaa aaccaagagt 27508 
ggtcaaccgt gctctcaatc tattttcaat ttttcatttt gtccttaata ctttctctta 27568 
ttgtcgttaa caatgatctg gagcattggt ctcgcctttt tttggctgct tagtgcaaaa 27628 
gccactattt ttcacaggta tgtggaagaa ggaactagca ccctctttac gatacctgaa 27688 
acaattaagg cggctgatga agtttcttgg tacaaaggct cgctctcaga cggcaaccac 27748 
tcattctcag gacagaccct ttgcatccaa gaaacttatt ttaaatcaga actacaatac 27808 
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agctgcataa aaaacttttt ccatctctac aacatctcaa aaccctatga gggtatttac 27868 
aatgccaagg tttcagacaa ctccagcaca cggaactttt actttaatct gacagttatt 27 92 8 
aaagcaattt ccattcctat ctgtgagttt agctcccagt ttctttctga aacctactgt 27988 
ttaattacta taaactgcac taaaaatcgc cttcacacca ccataatcta caatcacaca 28048 
caatcacctt gggttttaaa cctaaaattt tctccacaca tgccttcgca atttctcacg 28108 
caagttaccg tctctaacat aagcaagcag tttggctttt actatccttt ccacgaactg 28168 
tgcgaaataa ttgaagccga atatgaacca gactacttta cttacattgc cattggtgta 2 822 8 
atcgttgttt gcctttgctt tgttattggg gggtgtgttt atttgtacat tcagagaaaa 2 8288 
atattgctct cgctgtgctc ctgcggttac aaagcagaag aaagaattaa aatctctaca 28348 
ctttattaat gttttccaga aatggcaaaa ctaacgctcc tacttttgct tctcacgccg 28408 
gtgacgcttt ttaccatcac tttttctgcc gccgccacac tcgaacctca atgtttgcca 28468 
ccggttgaag tctactttgt ctacgtgttg ctgtgctgcg ttagcgtttg cagtataaca 28528 
tgttttacct ttgtttttct tcagtgcatt gactacttct gggtcagact ctactaccgc 28588 
agacacgcgc ctcagtatca aaatcaacaa attgccagac tactcggtct gccatgattg 28648 
tcttgtattt taccctgatt ttttttcacc ttacttgcgc ttgtgatttt cacttcactc 28708 
aattttggaa aacgcaatgc ttcgacccgc gcctctccaa cgactggatg atggctcttg 28768 
caattgccac gcttggggcg tttggacttt ttagtggttt tgctttgcat tacaaattta 28828 
agactccatg gacacatggc tttctttcag attttccagt tacacctact ccgccgcctc 28888 
ccccggccat cgacgtgcct caggttccct caccttctcc atctgtctgc agctactttc 28948 
atctgtaatg gccgacctag aatttgacgg agtgcaatct gagcaaaggg ctatacactt 29008 
ccaacgccag tcggaccgcg aacgcaaaaa cagagagctg caaaocatac aaaacaccca 29068 
ccaatgtaaa cgcgggatat tttgtattgt aaaacaagct aagctccact acgagcttct 29128 
atctggcaac gaccacgagc tccaatacgt ggtcgatcag cagcgtcaaa cctgtgtatt 29188 
cttaattgga gtttccccca ttaaagttac tcaaaccaag ggtgaaacca agggaaccat 29248 
aaggtgctca tgtcacctgt cagaatgcct ttacactcta gttaaaaccc tatgtggctt 29308 
acatgattct atccccttta attaaataaa cttactttaa atctgcaatc acttcttcgt 29368 
ccttgttttt gtcgccatcc agcagcacca ccttcccctc ttcccaactt tcatagcata 29428 
ttttccgaaa agaggcgtac tttcgccaca ccttaaaggg aacgtttact tcgctttcaa 2948 8 
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gctctcccac gattttcatt gcagat atg aaa cgc gcc aaa gtg gaa gaa gga 29541 

Met Lys Arg Ala Lys Val Glu Glu Gly 
1425 



ttt aac ccc gtt tat ccc tat gga tat tct act ccg act gac gtg 29586- 
Phe Asn Pro Val Tyr Pro Tyr Gly Tyr Ser Thr Pro Thr Asp Val 
1430 1435 1440 



get cct ccc ttt gta gcc tct gac ggt ctt caa gaa aac cca cct 29631 

Ala Pro Pro Phe Val Ala Ser Asp Gly Leu Gin Glu Asn Pro Pro 
1445 1450 1455 

ggg gtc ttg tec eta aaa ata tec aaa cct tta act ttt aat gcc 29676 

Gly Val Leu Ser Leu Lys lie Ser Lys Pro Leu Thr Phe Asn Ala 
1460 1465 1470 

tec aag get eta age ctg get att ggt cca gga tta aaa att caa 29721 

Ser Lys Ala Leu Ser Leu Ala lie Gly Pro Gly Leu Lys lie Gin 
1475 1480 1485 

gat ggt aaa eta gtg ggg gag gga caa gca att ctt gca aac ctg 297 66 

Asp Gly Lys Leu Val Gly Glu Gly Gin Ala He Leu Ala Asn Leu 
1490 1495 1500 

ccg ctt caa ate ace aac aac aca att tea eta cgt ttt ggg aac 29811 

Pro Leu Gin He Thr Asn Asn Thr He Ser Leu Arg Phe Gly Asn 
1505 1510 1515 

aca ctt gcc ttg aat gac aat aat gaa etc caa ace aca eta aaa 29856 

Thr Leu Ala Leu Asn Asp Asn Asn Glu Leu Gin Thr Thr Leu Lys 
1520 1525 1530 



tct tea teg ccc ctt aaa ate aca gac cag act ctg tec ctt aac 29901 
Ser Ser Ser Pro Leu Lys He Thr Asp Gin Thr Leu Ser Leu Asn 
1535 1540 1545 

ata ggg gac age ctt gca att aaa gat gac aaa eta gaa age get 29946 
He Gly Asp Ser Leu Ala He Lys Asp Asp Lys Leu Glu Ser Ala 
1550 1555 1560 

ctt caa gcg acc etc cca etc tec att age aae aac acc ate age 29991 
Leu Gin Ala Thr Leu Pro Leu Ser He Ser Asn Asn Thr He Ser 
1565 1570 1575 



etc aac gtg ggc acc gga etc acc ata aat gga aac gtt tta caa 30036 

Leu Asn Val Gly Thr Gly Leu Thr He Asn Gly Asn Val Leu Gin 

1580 ~ 1585 1590 

get gtt ccc tta aat get eta agt ccc eta act att tec aac aat 30081 

Ala Val Pro Leu Asn Ala Leu Ser Pro Leu Thr He Ser Asn Asn 

1595 1600 1605 

aac ate age ctg cgc tat ggc agt tec ctg acg gtg ctt aac aat 30126 

Asn He Ser Leu Arg Tyr Gly Ser Ser Leu Thr Val Leu Asn Asn 

1610 1615 1620 
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gaa ctg caa age aac etc aca gtt cac tec cct tta aaa etc aac 30171 

Glu Leu Gin Ser Asn Leu Thr Val His Ser Pro Leu Lys Leu Asn 
1625 1630 1635 

tec aac aac tea att tct etc aac act eta tct ccg ttt aga ate 30216 

Ser Asn Asn Ser lie Ser Leu Asn Thr Leu Ser Pro Phe Arg lie 
1640 1645 1650 

gag aat ggt ttc etc acg etc tat ttg gga aca aaa tct ggc ttg 30261 

Glu Asn Gly Phe Leu Thr Leu Tyr Leu Gly Thr Lys Ser Gly Leu 
1655 1660 1665 

eta gtt caa aac agt ggc tta aaa gtt caa gcg ggc tac ggc ctg 30306 

Leu Val Gin Asn Ser Gly Leu Lys Val Gin Ala Gly Tyr Gly Leu 
1670 1675 1680 

caa gta aca gac ace aat get etc aca tta aga tat etc get cca 30351 

Gin Val Thr Asp Thr Asn Ala Leu Thr Leu Arg Tyr Leu Ala Pro 
1685 1690 1695 

ctg ace att cca gac teg ggc tea gaa caa ggc att ctt aaa gta 30396 

Leu Thr lie Pro Asp Ser Gly Ser Glu Gin Gly lie Leu Lys Val 
1700 1705 1710 

aac act gga cag ggc eta agt gtg aac caa get gga gcg ctt gaa 30441 

Asn Thr Gly Gin Gly Leu Ser Val Asn Gin Ala Gly Ala Leu Glu 
1715 1720 1725 

aca tec eta gga ggt gga tta aaa tat get gat aac aaa ata ace 30486 

Thr Ser Leu Gly Gly Gly Leu Lys Tyr Ala Asp Asn Lys lie Thr 
1730 1735 1740 

ttt gat aca gga aac gga ctg aca tta tct gaa aat aaa ctt gca 30531 

Phe Asp Thr Gly Asn Gly Leu Thr Leu Ser Glu Asn Lys Leu Ala 
1745 1750 1755 

gta get gca ggt agt ggt eta act ttt aga gat ggt gee ttg gta 30576 

Val Ala Ala Gly Ser Gly Leu Thr Phe Arg Asp Gly Ala Leu Val 
1760 1765 1770 

gee acg gga ace gca ttt acg caa aca ctg tgg act acg get gat 30621 

Ala Thr Gly Thr Ala Phe Thr Gin Thr Leu Trp Thr Thr Ala Asp 

1775 "* 1780 1785 

ccg tct ccc aac tgc aca att ata cag gac cgc gac aca aaa ttt 30666 

Pro Ser Pro Asn Cys Thr lie lie Gin Asp Arg Asp Thr Lys Phe 
1790 1795 1800 

act ttg gcg ctt ace att agt ggg age caa gtg ctg ggg acg gtt 30711 

Thr Leu Ala Leu Thr lie Ser Gly Ser Gin Val Leu Gly Thr Val 
1805 1810 1815 

tec att att gga gta aaa ggc ccc ctt tea agt age ata ccg tea 30756 

Ser He He Gly Val Lys Gly Pro Leu Ser Ser Ser He Pro Ser 
1820 1825 1830 
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get acc gtt aca gta caa ctt aac ttt gat tec aac gga gec eta 
Ala Thr Val Thr Val Gin Leu Asn Phe Asp Ser Asn Gly Ala Leu 
1835 1840 1845 



30801 



ttg age tec tct tea ctt aaa ggt tac tgg ggg tat cgc caa ggt 
Leu Ser Ser Ser Ser Leu Lys Gly Tyr Trp Gly Tyr Arg Gin Gly 
1850 1855 1860 



30846 



ccc tea att gac cct tac ccc ata att aat gee tta aac ttt atg 
Pro Ser lie Asp Pro Tyr Pro lie lie Asn Ala Leu Asn Phe Met 
1865 " 1870 1875 



30891 



cca aac tea ctg get tat ccc ccg gga caa gaa ate caa gca aaa 
Pro Asn Ser Leu Ala Tyr Pro Pro Gly Gin Glu lie Gin Ala Lys 
1880 1885 1890 



30936 



tgt aac atg tac gtt tct act ttt tta cga gga aat cca caa aga 
Cys Asn Met Tyr Val Ser Thr Phe Leu Arg Gly Asn Pro Gin Arg 
1895 1900 1905 



30981 



cca ata gtt tta aac ate act ttt aat aat caa acc age ggg ttt 
Pro He Val Leu Asn He Thr Phe Asn Asn Gin Thr Ser Gly Phe 
1910 1915 1920 



31026 



tec att aga ttt aca tgg aca aat tta acc aca gga gaa gca ttt 
Ser He Arg Phe Thr Trp Thr Asn Leu Thr Thr Gly Glu Ala Phe 
1925 " 1930 1935 



31071 



gca atg ccc cca tgc act ttt tec tac att get gaa caa caa taa 
Ala Met Pro Pro Cys Thr Phe Ser Tyr He Ala Glu Gin Gin 
1940 1945 1950 



31116 



actatgtaac 


cctcaccgtt 


aacccgcctc 


cgcccttcca 


ttttatttta 


taaaccaccc 


31176 


gatccacctt 


ttcagcagta 


aacaattgea 


tgtcagtagg 


ggcagtaaaa 


cttttgggag 


31236 


ttaaaatcca 


cacaggttct 


tcacaagcta 


agegaaaate 


agttacactt 


ataaaaccat 


31296 


cgctaacatc 


ggacaaagac 


aagcatgagt 


ccaaagcttc 


cggttctgga 


tcagattttt 


31356 


gttcattaac 


agegggagaa 


acagcttctg 


gaggattttc 


catctccatc 


tccttcatca 


31416 


gttccaccat 


gtccaccgtg 


gtcatctggg 


acgagaacga 


cagttgtcat 


acacctcata 


31476 


agtcaccggt 


cgatgacgaa 


egtacagate 


tcgaagaatg 


tcctgtcgcc 


gcctttcggc 


31536 


agcactgggc 


egaaggegaa 


agcgcccatg 


tttaacaatg 


gccagcaccg 


cccgcttcat 


31596 


caggcgccta 


gttcttttag 


cgcaacagcg 


catgcgcagc 


tegctaagae 


tggegcaaga 


31656 


aacacagcac 


agaaccacca 


gattgttcat 


gatcccataa 


gcgtgctgac 


accagcccat 


31716 


actaacaaat 


tgtttcacta 


ttctagcatg 


aatgtcatat 


ctgatgttca 


agtaaattaa 


31776 


atggcgcccc 


cttatgtaaa 


cacttcccac 


gtacaacacc 


tcctttggca 


tctgataatt 


31836 


aaccacctcc 


cgataccaaa 


tacatctctg 


attaatagtc 


gccccgtaca 


ctacccgatt 


31896 
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aaaccaagtt 


gccaacataa 


tcccccctgc 


catacactgc 


aaagaacctg 


gacggctaca 


31956 


atgacagtgc 


aaagtccaca 


cctcgttgcc 


atggataact 


gaggaacgcc 


ttaagtcaat 


32016 


agtggcacaa 


ctaatacaaa 


catgtaaata 


gtgtttcaac 


aagtgccact 


cgtatgaggt 


32076 


gagtatcatg 


tcccagggaa 


cgggccactc 


cataaacact 


gcaaaaccaa 


cacatcctac 


32136 


catcccccgc 


acggcactca 


catcgtgcat 


ggtgttcata 


tcacagtccg 


gaagctgagg 


32196 


acaaggaaaa 


gtctcgggag 


cattttcata 


gggcggtagt 


gggtactcct 


tgtaggggtt 


32256 


cagtcggcac 


cggtatctcc 


tcaccttctg 


ggccataaca 


cacaagttga 


gatctgattt 


32316 


caaggtactt 


tctgaatgaa 


aaccaagtgc 


tttcccaaca 


atgtatccga 


tgtcttcggt 


32376 


ccccgcgtcg 


gtagcgctcc 


ttgcagtaca 


cacggaacaa 


ccactcacgc 


aggcccagaa 


32436 


gacagttttc 


cgcggacggt 


gacaagttaa 


tccccctcag 


tctcagagcc 


aatatagttt 


32496 


cttccacagt 


agcataggcc 


aaacccaacc 


aggaaacaca 


agctggcacg 


tcccgttcaa 


32556 


cgggaggaca 


aggaagcaga 


ggcagaggca 


taggcaaagc 


aacagaattt 


ttattccaac 


32616 


tggtcacgta 


gcacttcaaa 


caccaggtca 


cgtaaatggc 


agcgatcttg 


ggtttcctga 


32676 


tggaacataa 


cagcaagatc 


aaacatgaga 


cgattctcaa 


ggtgattaac 


cacagctgga 


32736 


attaaatcct 


ccacgcgcac 


atttagaaac 


accagcaata 


caaaagcccg 


gttttctccg 


32796 


ggatctatca 


tagcagcaca 


gtcatcaatt 


agtcccaagt 


aattttcccg 


tttccaatct 


32856 


gttataattt 


gcagaataat 


gccctgtaaa 


tccaagccgg 


ccatggcgaa 


aagctcagat 


32916 


aatgcacttt 


ccacgtgcat 


tcgtaaacac 


accctcatct 


tgtcaatcca 


aaaagtcttc 


32976 


ttcttgagaa 


acctgtagta 


aattaagaat 


cgccaggtta 


ggctcgatgc 


ctacatcccg 


33036 


gagcttcatt 


ctcagcatgc 


actgcaaatg 


atccagcaga 


tcagaacagc 


aattagcagc 


33096 


cagctcatcc 


ccggtttcca 


gttccggagt 


tcccacggca 


attatcactc 


gaaacgtggg 


33156 


acaaatcgaa 


ataacatgag 


ctcccacgtg 


agcaaaagcc 


gtagggccag 


tgcaataatc 


33216 


acagaaccag 


cggaaaaaag 


attgcagctc 


atgtttcaaa 


aagctctgca 


gatcaaaatt 


33276 


cagctcatgc 


aaataacaca 


gtaaagtttg 


cggtatagta 


accgaaaacc 


acacgggtcg 


33336 


acgttcaaac 


atctcggctt 


acctaaaaaa 


gaagcacatt 


tttaaaccac 


agtcgcttcc 


33396 


tgaacaggag 


gaaatatggt 


gcggcgtaaa 


accagacgcg 


ccaccggatc 


tccggcagag 


33456 


ccctgataat 


acagccagct 


gtggttaaac 


agcaaaacct 


ttaattcggc 


aacggttgag 


33516 


gtctccacat 


aatcagcgcc 


cacaaaaatc 


ccatctcgaa 


cttgctcgcg 


tagggagcta 


33576 
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aaatggccag tatagcccca tggcacccga acgctaatct gcaagtatat gagagccacc 33636 

ccattcggcg ggatcacaaa atcagtcgga gaaaacaacg tatacacccc ggactgcaaa 33696 

agctgttcag gcaaacgccc ctgcggtccc tctcggtaca ccagcaaagc ctcgggtaaa 33756 

gcagccatgc caagcgctta ccgtgccaag agcgactcag acgaaaaagt gtactgaggc 33816 

gctcagagca gcggctatat actctacctg tgacgtcaag aaccgaaagt caaaagttca 3387 6 

cccggcgcgc ccgaaaaaac ccgcgaaaat ccacccaaaa agcccgcgaa aaacacttcc 33936 

gtataaaatt tccgggttac cggcgcgtca ccgccgcgcg acacgcccgc cccgccccgc 33996 

gctcctcccc gaaacccgcc gcgcccactt ccgcgttccc aagacaaagg tcgcgtaact 34056 

ccgcccacct catttgcatg ttaactcggt cgccatcttg cggtgttata ttgatgatg 34115 

<210> 35 
<211> 503 
<212> PRT 

<213> simian adenovirus SV-39 
<400> 35 

Met Arg Arg Ala Val Ala Val Pro Ser Ala Ala Met Ala Leu Gly Pro 
1 " " 5 10 15 

Pro Pro Ser Tyr Glu Ser Val Met Ala Ala Ala Thr Leu Gin Ala Pro 
20 25 30 

Leu Glu Asn Pro Tyr Val Pro Pro Arg Tyr Leu Glu Pro Thr Gly Gly 
35 ' 40 45 

Arg Asn Ser lie Arg Tyr Ser Glu Leu Thr Pro Leu Tyr Asp Thr Thr 
50 " 55 60 

Arg Leu Tyr Leu Val Asp Asn Lys Ser Ala Asp lie Ala Thr Leu Asn 
65 ~ 70 75 80 

Tyr Gin Asn Asp His Ser Asn Phe Leu Thr Ser Val Val Gin Asn Ser 
85 90 95 

Asp Tyr Thr Pro Ala Glu Ala Ser Thr Gin Thr He Asn Leu Asp Asp 
100 105 HO 

Arg Ser Arg Trp Gly Gly Asp Leu Lys Thr He Leu His Thr Asn Met 
115 " " 12 0 125 

Pro Asn Val Asn Glu Phe Met Phe Thr Asn Ser Phe Arg Ala Lys Leu 
130 135 140 

Met Val Ala His Glu Ala Asp Lys Asp Pro Val Tyr Glu Trp Val Gin 
145 150 155 160 
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Leu Thr Leu Pro Glu 
165 

Met Asn Asn Ala lie 
180 

Gly Val Lys Glu Ser 
195 

Arg Leu Gly Trp Asp 
210 

Thr Asn Glu Ala Phe 
225 

Val Asp Phe Thr Tyr 
245 

Arg Met Pro Phe Gin 
260 

Gly Gly Asn lie Pro 
275 

lie Ala Asn Ala Arg 
290 

Gin Pro Gin Ala Ala 
305 

Gly Arg Ser Tyr Asn 
325 

Ser Trp Tyr Leu Ala 
340 

Ala Trp Thr Leu Leu 
355 

Val Tyr Trp Ser Leu 
370 

Ser Thr Gin Gin Val 
385 

Pro lie His Ser Lys 
405 

Leu He Arg Gin Thr 
420 

Glu Asn Gin He Leu 
435 

Ser Glu Asn Val Pro 
450 



Gly Asn Phe Ser Glu lie 
170 

lie Asp His Tyr Leu Ala 
185 

Glu He Gly Val Lys Phe 
200 

Pro Glu Thr Gly Leu Val 
215 

His Pro Asp Val Val Leu 
230 235 

Ser Arg Leu Asn Asn Leu 
250 

Glu Gly Phe Gin He Leu 
265 

Ala Leu Leu Asp Val Pro 
280 

Glu Ala Ala He Arg Gly 
295 

Pro Thr He Lys Pro Val 
310 315 

Val He Ala Asn Thr Asn 
330 

Tyr Asn Tyr Gly Asp Pro 
345 

Thr Thr Pro Asp Val Thr 
360 

Pro Asp Met Tyr Val Asp 
375 

Ser Asn Tyr Pro Val Val 
390 395 

Ser Phe Tyr Asn Glu Gin 
410 

Thr Ala Leu Thr His Val 
425 

Val Arg Pro Pro Ala Pro 
440 

Ala Leu Thr Asp His Gly 
455 



Met Thr He Asp Leu 
175 

Val Ala Arg Gin Gin 
190 

Asp Thr Arg Asn Phe 
205 

Met Pro Gly Val Tyr 
220 

Leu Pro Gly Cys Gly 
240 

Leu Gly He Arg Lys 
255 

Tyr Glu Asp Leu Glu 
270 

Ala Tyr Glu Glu Ser 
285 

Asp Asn Phe Ala Ala 
300 

Leu Glu Asp Ser Lys 
320 

Asn Thr Ala Tyr Arg 
335 



Glu Lys Gly Val Arg 
350 

Cys Gly Ser Glu Gin 
365 

Pro Val Thr Phe Arg 
380 

Gly Ala Glu Leu Met 
4 00 

Ala Val Tyr Ser Gin 
415 

Phe Asn Arg Phe Pro 
430 

Thr He Thr Thr Val 
445 

Thr Leu Pro Leu Gin 
460 
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Asn Ser lie Arg Gly Val Gin Arg Val Thr lie Thr Asp Ala Arg Arg 

465 470 475 480 

Arg Thr Cys Pro Tyr Val Tyr Lys Ala Leu Gly lie Val Ala Pro Arg 

485 490 495 



Val Leu Ser Ser Arg Thr Phe 
500 



<210> 36 
<211> 917 
<212> PRT 

<213> simian adenovirus SV-39 
<400> 36 

Met Ala Thr Pro Ser Met Met Pro 
1 5 

Gly Gin Asp Ala Ser Glu Tyr Leu 
20 



Gin Trp Ser Tyr Met His lie Ala 
10 15 

Ser Pro Gly Leu Val Gin Phe Ala 
25 30 



Arg Ala Thr Glu 
35 

Thr Val Ala Pro 
50 



Thr Tyr Phe Ser 
40 

Thr His Asp Val 
55 



Leu Gly Asn Lys 

Thr Thr Asp Arg 
60 



Phe Arg Asn Pro 
45 

Ser Gin Arg Leu 



Thr He Arg Phe Val Pro Val Asp Lys Glu Asp Thr Ala Tyr Ser Tyr 
65 7 0 75 8 0 

Lys Thr Arg Phe Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 
85 90 95 



Ala Ser Thr Tyr Phe Asp He Arg Gly Val He Asp Arg Gly Pro Ser 
100 105 110 

Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 
115 " 120 125 

Ala Pro Asn Asn Ser Gin Trp Asn Ala Thr Asp Asn Gly Asn Lys Pro 
130 135 140 

Val Cys Phe Ala Gin Ala Ala Phe He Gly Gin Ser He Thr Lys Asp 
145 150 155 160 

Gly Val Gin He Gin Asn Ser Glu Asn Gin Gin Ala Ala Ala Asp Lys 
165 170 175 

Thr Tyr Gin Pro Glu Pro Gin He Gly Val Ser Thr Trp Asp Thr Asn 
180 185 190 

Val Thr Ser Asn Ala Ala Gly Arg Val Leu Lys Ala Thr Thr Pro Met 
195 200 205 
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Leu Pro Cys Tyr Gly Ser Tyr Ala Asn Pro Thr Asn Pro Asn Gly Gly 
210 ' 215 220 

Gin Ala Lys Thr Glu Gly Asp lie Ser Leu Asn Phe Phe Thr Thr Thr 
225 230 235 240 

Ala Ala Ala Asp Asn Asn Pro Lys Val Val Leu Tyr Ser Glu Asp Val 
245 250 255 

Asn Leu Gin Ala Pro Asp Thr His Leu Val Tyr Lys Pro Thr Val Gly 
260 265 270 

Glu Asn Val He Ala Ala Glu Ala Leu Leu Thr Gin Gin Ala Cys Pro 
275 280 285 

Asn Arg Ala Asn Tyr He Gly Phe Arg Asp Asn Phe He Gly Leu Met 
290 295 300 

Tyr Tyr Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser 
305 310 315 320 

Gin Leu Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser 
325 330 335 

Tyr Gin Leu Met Leu Asp Ala Leu Gly Asp Arg Thr Arg Tyr Phe Ser 
340 345 350 

Met Trp Asn Gin Ala Val Asp Ser Tyr Asp Pro Asp Val Arg He He 
355 360 365 

Glu Asn His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu 
370 375 380 

Pro Gly Met Gly He Phe Asn Ser Tyr Lys Gly Val Lys Pro Gin Asn 
385 ~ 390 395 400 

Gly Gly Asn Gly Asn Trp Glu Ala Asn Gly Asp Leu Ser Asn Ala Asn 
4 05 410 415 

Glu He Ala Leu Gly Asn He Phe Ala Met Glu He Asn Leu His Ala 
420 425 430 

Asn Leu Trp Arg Ser Phe Leu Tyr Ser Asn Val Ala Leu Tyr Leu Pro 
435 440 445 

Asp Ser Tyr Lys Phe Thr Pro Ala Asn He Thr Leu Pro Ala Asn Gin 
450 455 460 

Asn Thr Tyr Glu Tyr He Asn Gly Arg Val Thr Ser Pro Thr Leu Val 
465 470 475 480 

Asp Thr Phe Val Asn He Gly Ala Arg Trp Ser Pro Asp Pro Met Asp 
485 490 495 

Asn Val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg 
500 505 510 
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Ser Met Leu Leu Gly Asn Gly Arg Val Val Pro Phe His lie Gin Val 
515 520 525 

Pro Gin Lys Phe Phe Ala lie Lys Asn Leu Leu Leu Leu Pro Gly Ser 
530 535 540 

Tyr Thr Tyr Glu Trp Ser Phe Arg Lys Asp Val Asn Met lie Leu Gin 
545 " 550 555 560 

Ser Thr Leu Gly Asn Asp Leu Arg Val Asp Gly Ala Ser Val Arg lie 
565 570 575 

Asp Ser Val Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala His Asn Thr 
580 585 590 

Ala Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser 
595 600 605 

Phe Asn Asp Tyr Leu Ser Ala Ala Asn Met Leu Tyr Pro lie Pro Ala 
610 J 615 620 

Asn Ala Thr Asn Val Pro lie Ser He Pro Ser Arg Asn Trp Ala Ala 
625 630 635 640 

Phe Arg Gly Trp Ser Phe Thr Arg Leu Lys Ala Lys Glu Thr Pro Ser 
645 650 655 

Leu Gly Ser Gly Phe Asp Pro Tyr Phe Val Tyr Ser Gly Thr He Pro 
660 665 670 

Tyr Leu Asp Gly Ser Phe Tyr Leu Asn His Thr Phe Lys Arg Leu Ser 
675 ~ 680 685 

He Met Phe Asp Ser Ser Val Ser Trp Pro Gly Asn Asp Arg Leu Leu 
690 695 700 

Thr Pro Asn Glu Phe Glu He Lys Arg He Val Asp Gly Glu Gly Tyr 
705 710 715 720 

Asn Val Ala Gin Ser Asn Met Thr Lys Asp Trp Phe Leu He Gin Met 
725 730 735 

Leu Ser His Tyr Asn He Gly Tyr Gin Gly Phe Tyr Val Pro Glu Gly 
740 745 750 

Tyr Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser 
755 " 760 765 

Arg Gin Val Pro Asp Pro Thr Ala Ala Gly Tyr Gin Ala Val Pro Leu 
770 775 780 

Pro Arg Gin His Asn Asn Ser Gly Phe Val Gly Tyr Met Gly Pro Thr 
785 790 795 800 

Met Arg Glu Gly Gin Pro Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu He 
805 810 815 
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Gly Ala Thr Ala 
820 

Arg Val Met Trp 
835 

Ala Leu Thr Asp 
850 

Ala Leu Asp Met 
865 

Leu Tyr Met Leu 



His Arg Gly He 
900 



Gly Asn Ala Thr 
915 



Val Pro Ala He 



Arg He Pro Phe 
840 

Leu Gly Gin Asn 
855 

Thr Phe Glu Val 
870 

Phe Glu Val Phe 
885 

He Glu Ala Val 



Thr 



Thr Gin Lys Lys 
825 

Ser Ser Asn Phe 



Met Leu Tyr Ala 
860 

Asp Pro Met Asn 
875 

Asp Val Val Arg 
890 

Tyr Leu Arg Thr 
905 



Phe Leu Cys Asp 
830 

i 

Met Ser Met Gly 
845 

Asn Ser Ala His 



Glu Pro Thr Leu 
880 

Val His Gin Pro 
895 

Pro Phe Ser Ala 
910 



<210> 37 

<211> 533 

<212> PRT 

<213> simian adenovirus SV-39 

<400> 37 

Met Lys Arg Ala Lys Val Glu Glu Gly Phe Asn Pro Val Tyr Pro Tyr 
1 5 10 15 

Gly Tyr Ser Thr Pro Thr Asp Val Ala Pro Pro Phe Val Ala Ser Asp 
20 25 30 

Gly Leu Gin Glu Asn Pro Pro Gly Val Leu Ser Leu Lys He Ser Lys 
35 40 45 

Pro Leu Thr Phe Asn Ala Ser Lys Ala Leu Ser Leu Ala He Gly Pro 
50 55 60 

Gly Leu Lys He Gin Asp Gly Lys Leu Val Gly Glu Gly Gin Ala He 
65 "* 70 75 80 

Leu Ala Asn Leu Pro Leu Gin He Thr Asn Asn Thr He Ser Leu Arg 
85 90 95 

Phe Gly Asn Thr Leu Ala Leu Asn Asp Asn Asn Glu Leu Gin Thr Thr 
100 105 HO 

Leu Lys Ser Ser Ser Pro Leu Lys He Thr Asp Gin Thr Leu Ser Leu 
115 120 125 

Asn He Gly Asp Ser Leu Ala He Lys Asp Asp Lys Leu Glu Ser Ala 
130 ** 135 140 
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Leu Gin Ala Thr Leu Pro Leu Ser lie Ser Asn Asn Thr lie Ser Leu 
145 150 155 160 

Asn Val Gly Thr Gly Leu Thr lie Asn Gly Asn Val Leu Gin Ala Val 
165 170 175 

Pro Leu Asn Ala Leu Ser Pro Leu Thr lie Ser Asn Asn Asn lie Ser 
180 185 190 

Leu Arg Tyr Gly Ser Ser Leu Thr Val Leu Asn Asn Glu Leu Gin Ser 
195 200 205 

Asn Leu Thr Val His Ser Pro Leu Lys Leu Asn Ser Asn Asn Ser He 
210 215 220 

Ser Leu Asn Thr Leu Ser Pro Phe Arg He Glu Asn Gly Phe Leu Thr 
225 230 235 " 240 

Leu Tyr Leu Gly Thr Lys Ser Gly Leu Leu Val Gin Asn Ser Gly Leu 
245 . 250 255 

Lys Val Gin Ala Gly Tyr Gly Leu Gin Val Thr Asp Thr Asn Ala Leu 
260 265 270 

Thr Leu Arg Tyr Leu Ala Pro Leu Thr He Pro Asp Ser Gly Ser Glu 
275 280 285 

Gin Gly He Leu Lys Val Asn Thr Gly Gin Gly Leu Ser Val Asn Gin 
290 295 "* 300 

Ala Gly Ala Leu Glu Thr Ser Leu Gly Gly Gly Leu Lys Tyr Ala Asp 
305 310 315 320 

Asn Lys He Thr Phe Asp Thr Gly Asn Gly Leu Thr Leu Ser Glu Asn 
325 330 335 

Lys Leu Ala Val Ala Ala Gly Ser Gly Leu Thr Phe Arg Asp Gly Ala 
340 345 350 

Leu Val Ala Thr Gly Thr Ala Phe Thr Gin Thr Leu Trp Thr Thr Ala 
355 360 365 

Asp Pro Ser Pro Asn Cys Thr He He Gin Asp Arg Asp Thr Lys Phe 
370 375 380 

Thr Leu Ala Leu Thr He Ser Gly Ser Gin Val Leu Gly Thr Val Ser 
385 390 395 400 

He He Gly Val Lys Gly Pro Leu Ser Ser Ser He Pro Ser Ala Thr 
405 410 415 

Val Thr Val Gin Leu Asn Phe Asp Ser Asn Gly Ala Leu Leu Ser Ser 
420 425 430 

Ser Ser Leu Lys Gly Tyr Trp Gly Tyr Arg Gin Gly Pro Ser He Asp 
435 440 445 
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Pro Tyr Pro lie lie Asn Ala Leu Asn Phe Met Pro Asn Ser Leu Ala 
450 455 460 

Tyr Pro Pro Gly Gin Glu lie Gin Ala Lys Cys Asn Met Tyr Val Ser 
465 470 ~ 475 480 

Thr Phe Leu Arg Gly Asn Pro Gin Arg Pro lie Val Leu Asn lie Thr 
485 490 495 

Phe Asn Asn Gin Thr Ser Gly Phe Ser lie Arg Phe Thr Trp Thr Asn 
500 ~ 505 ~ 510 

Leu Thr Thr Gly Glu Ala Phe Ala Met Pro Pro Cys Thr Phe Ser Tyr 
515 520 525 

lie Ala Glu Gin Gin 
530 

<210> 38 

<211> 50 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligomer SV25T 

<400> 38 

aatttaaata cgtagcgcac tagtcgcgct aagcgcggat atcatttaaa 50 



<210> 


39 


<211> 


49 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


oligomer SV25B 


<400> 


39 



tatttaaatg atatccgcgc ttaagcgcga ctagtgcgct acgtattta 49 
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